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<140> 
<141> 

<150> 60/233,867 
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<160> 30 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 1209 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (33) . . (854) 

<220> 

<221> sig_peptide 
<222> (33) . . (89) 

<220> 

<221> misc_f eature 
<222> (693) . . (755) 

<223> predicted transmembrane domain 
<400> 1 

cagaaagaga cctatatgat caaatacaga ac atg ate ttc etc ctg eta atg 53 

Met lie Phe Leu Leu Leu Met 
1 5 

ttg age ctg gaa ttg cag ctt cac cag ata gca get tta ttc aca gtg 101 
Leu Ser Leu Glu Leu Gin Leu His Gin lie Ala Ala Leu Phe Thr Val 
10 15 20 

aca gtc cct aag gaa ctg tac ata ata gag cat ggc age aat gtg ace 14 9 
Thr Val Pro Lys Glu Leu Tyr lie lie Glu His Gly Ser Asn Val Thr 
25 30 35 

ctg gaa tgc aac ttt gac act gga agt cat gtg aac ctt gga gca ata 197 
Leu Glu Cys Asn Phe Asp Thr Gly Ser His Val Asn Leu Gly Ala lie 
40 45 50 55 

aca gec agt ttg caa aag gtg gaa aat gat aca tec cca cac cgt gaa 245 



1 



Thr Ala Ser Leu Gin Lys Val Glu Asn Asp Thr Ser Pro His Arg Glu 
60 65 70 



aga gcc act ttg ctg gag gag cag ctg ccc eta ggg aag gec teg ttc 2 93 
Arg Ala Thr Leu Leu Glu Glu Gin Leu Pro Leu Gly Lys Ala Ser Phe 
75 80 85 

cac ata cct caa gtc caa gtg agg gac gaa gga cag tac caa tgc ata 341 
His He Pro Gin Val Gin Val Arg Asp Glu Gly Gin Tyr Gin Cys He 
90 95 100 



ate ate tat ggg gtc gcc tgg gac tac aag tac ctg act ctg aaa gtc 
He He Tyr Gly Val Ala Trp Asp Tyr Lys Tyr Leu Thr Leu Lys Val 
105 HO 115 

aaa get tec tac agg aaa ata aac act cac ate eta aag gtt cca gaa 
Lys Ala Ser Tyr Arg Lys He Asn Thr His He Leu Lys Val Pro Glu 
120 125 130 135 

aca gat gag gta gag etc acc tgc cag get aca ggt tat cct ctg gca 
Thr Asp Glu Val Glu Leu Thr Cys Gin Ala Thr Gly Tyr Pro Leu Ala 
140 145 150 

gaa gta tec tgg cca aac gtc age gtt cct gcc aac acc age cac tec 
Glu Val Ser Trp Pro Asn Val Ser Val Pro Ala Asn Thr Ser His Ser 
155 160 165 

agg acc cct gaa ggc etc tac cag gtc acc agt gtt ctg cgc eta aag 
Arg Thr Pro Glu Gly Leu Tyr Gin Val Thr Ser Val Leu Arg Leu Lys 
170 175 180 

cca ccc cct ggc aga aac ttc age tgt gtg ttc tgg aat act cac gtg 
Pro Pro Pro Gly Arg Asn Phe Ser Cys Val Phe Trp Asn Thr His Val 
185 190 195 

agg gaa ctt act ttg gcc age att gac ctt caa agt cag atg gaa ccc 
Arg Glu Leu Thr Leu Ala Ser He Asp Leu Gin Ser Gin Met Glu Pro 
200 205 210 215 



tgt caa aag ctg tat tct tea aaa gac aca aca aaa aga cct gtc acc 
Cys Gin Lys Leu Tyr Ser Ser Lys Asp Thr Thr Lys Arg Pro Val Thr 
250 255 260 



389 



437 



485 



533 



581 



62 9 



677 



agg acc cat cca act tgg ctg ctt cac att ttc ate ccc tec tgc ate 725 
Arg Thr His Pro Thr Trp Leu Leu His He Phe He Pro Ser Cys He 
220 225 230 

att get ttc att ttc ata gcc aca gtg ata gcc eta aga aaa caa etc 773 
He Ala Phe He Phe He Ala Thr Val He Ala Leu Arg Lys Gin Leu 
235 240 245 



821 



aca aca aag agg gaa gtg aac agt get ate tga acctgtggtc ttgggagcca 874 
Thr Thr Lys Arg Glu Val Asn Ser Ala He 
265 270 

gggtgacctg atatgacatc taaagaagct tctggactct gaacaagaat tcggtggcct 934 



2 



gcagagcttg ccatttgcac ttttcaaatg cctttggatg acccagcact ttaatctgaa 994 
acctgcaaca agactagcca acacctggcc atgaaacttg ccccttcact gatctggact 1054 
cacctctgga gcctatggct ttaagcaagc actactgcac tttacagaat taccccactg 1114 
gatcctggac ccacagaatt ccttcaggat ccttcttgct gccagactga aagcaaaagg 1174 
aattatttcc cctcaagttt tctaagtgat ttcca 1209 



<210> 2 
<211> 273 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met lie Phe Leu Leu Leu Met Leu Ser Leu Glu Leu Gin Leu His Gin 
15 10 15 

lie Ala Ala Leu Phe Thr Val Thr Val Pro Lys Glu Leu Tyr lie lie 
20 25 30 

Glu His Gly Ser Asn Val Thr Leu Glu Cys Asn Phe Asp Thr Gly Ser 
35 40 45 

His Val Asn Leu Gly Ala He Thr Ala Ser Leu Gin Lys Val Glu Asn 
50 55 60 

Asp Thr Ser Pro His Arg Glu Arg Ala Thr Leu Leu Glu Glu Gin Leu 
65 70 75 80 

Pro Leu Gly Lys Ala Ser Phe His He Pro Gin Val Gin Val Arg Asp 
85 90 95 

Glu Gly Gin Tyr Gin Cys He He He Tyr Gly Val Ala Trp Asp Tyr 
100 105 110 

Lys Tyr Leu Thr Leu Lys Val Lys Ala Ser Tyr Arg Lys He Asn Thr 
115 120 125 

His He Leu Lys Val Pro Glu Thr Asp Glu Val Glu Leu Thr Cys Gin 
130 135 140 

Ala Thr Gly Tyr Pro Leu Ala Glu Val Ser Trp Pro Asn Val Ser Val 
145 150 155 160 

Pro Ala Asn Thr Ser His Ser Arg Thr Pro Glu Gly Leu Tyr Gin Val 
165 170 175 

Thr Ser Val Leu Arg Leu Lys Pro Pro Pro Gly Arg Asn Phe Ser Cys 
180 185 190 

Val Phe Trp Asn Thr His Val Arg Glu Leu Thr Leu Ala Ser He Asp 
195 200 205 

Leu Gin Ser Gin Met Glu Pro Arg Thr His Pro Thr Trp Leu Leu His 



210 



215 



220 



He Phe He Pro Ser Cys He He Ala Phe He Phe He Ala Thr Val 

225 230 235 240 

He Ala Leu Arg Lys Gin Leu Cys Gin Lys Leu Tyr Ser Ser Lys Asp 
245 250 255 

Thr Thr Lys Arg Pro Val Thr Thr Thr Lys Arg Glu Val Asn Ser Ala 
260 265 270 



He 



<210> 3 
<211> 254 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> TRANSMEM 
<222> (202) . . (222) 

<400> 3 

Leu Phe Thr Val Thr Val Pro Lys Glu Leu Tyr He He Glu His Gly 
15 10 15 

Ser Asn Val Thr Leu Glu Cys Asn Phe Asp Thr Gly Ser His Val Asn 
20 25 30 

Leu Gly Ala He Thr Ala Ser Leu Gin Lys Val Glu Asn Asp Thr Ser 
35 40 45 

Pro His Arg Glu Arg Ala Thr Leu Leu Glu Glu Gin Leu Pro Leu Gly 
50 55 60 

Lys Ala Ser Phe His He Pro Gin Val Gin Val Arg Asp Glu Gly Gin 
65 70 75 80 

Tyr Gin Cys He He He Tyr Gly Val Ala Trp Asp Tyr Lys Tyr Leu 
85 90 95 

Thr Leu Lys Val Lys Ala Ser Tyr Arg Lys He Asn Thr His He Leu 
100 105 110 

Lys Val Pro Glu Thr Asp Glu Val Glu Leu Thr Cys Gin Ala Thr Gly 
115 120 125 

Tyr Pro Leu Ala Glu Val Ser Trp Pro Asn Val Ser Val Pro Ala Asn 
130 135 140 

Thr Ser His Ser Arg Thr Pro Glu Gly Leu Tyr Gin Val Thr Ser Val 
145 150 155 160 

Leu Arg Leu Lys Pro Pro Pro Gly Arg Asn Phe Ser Cys Val Phe Trp 
165 170 175 



4 



Asn Thr His Val Arg Glu Leu Thr Leu Ala Ser He Asp Leu Gin Ser 
180 185 190 



Gin Met Glu Pro Arg Thr His Pro 
195 200 

Pro Ser Cys He He Ala Phe He 
210 215 

Arg Lys Gin Leu Cys Gin Lys Leu 
225 230 

Arg Pro Val Thr Thr Thr Lys Arg 
245 



Thr Trp Leu Leu His He Phe He 
205 

Phe He Ala Thr Val He Ala Leu 
220 

Tyr Ser Ser Lys Asp Thr Thr Lys 
235 240 

Glu Val Asn Ser Ala He 
250 



<210> 4 
<211> 224 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Gly His Thr Arg Arg Gin Gly Thr Ser Pro Ser Lys Cys Pro Tyr 
15 10 15 

Leu Asn Phe Phe Gin Leu Leu Val Leu Ala Gly Leu Ser His Phe Cys 
20 25 30 

Ser Gly Val He His Val Thr Lys Glu Val Lys Glu Val Ala Thr Leu 
35 40 45 

Ser Cys Gly His Asn Val Ser Val Glu Glu Leu Ala Gin Thr Arg He 
50 55 60 

Tyr Trp Gin Lys Glu Lys Lys Met Val Leu Thr Met Met Ser Gly Asp 
65 70 75 80 

Met Asn He Trp Pro Glu Tyr Lys Asn Arg Thr He Phe Asp He Thr 
85 90 95 

Asn Asn Leu Ser He Val He Leu Ala Leu Arg Pro Ser Asp Glu Gly 
100 105 110 

Thr Tyr Glu Cys Val Val Leu Lys Tyr Glu Lys Asp Ala Phe Lys Arg 
115 120 125 

Glu His Leu Ala Glu Val Thr Leu Ser Val Lys Ala Asp Phe Pro Thr 
130 135 140 

Pro Ser He Ser Asp Phe Glu He Pro Thr Ser Asn He Arg Arg He 
145 150 155 160 

He Cys Ser Thr Ser Gly Gly Phe Pro Glu Pro His Leu Ser Trp Leu 
165 170 175 

Glu Asn Gly Glu Glu Leu Asn Ala He Asn Thr Thr Val Ser Gin Asp 
180 185 190 



5 



Pro Glu Thr Glu Leu Tyr Ala Val Ser Ser Lys Leu Asp Phe Asn Met 
195 200 205 

Thr Thr Asn His Ser Phe Met Cys Leu lie Lys Tyr Gly His Leu Arg 

210 215 220 



<210> 5 
<211> 323 
<212> PRT 

<213> Homo sapiens 
<400> 5 

Met Gly Leu Ser Asn lie Leu Phe Val Met Ala Phe Leu Leu Ser Gly 
15 10 15 

Ala Ala Pro Leu Lys lie Gin Ala Tyr Phe Asn Glu Thr Ala Asp Leu 
20 25 30 

Pro Cys Gin Phe Ala Asn Ser Gin Asn Gin Ser Leu Ser Glu Leu Val 
35 40 45 

Val Phe Trp Gin Asp Gin Glu Asn Leu Val Leu Asn Glu Val Tyr Leu 
50 55 60 

Gly Lys Glu Lys Phe Asp Ser Val His Ser Lys Tyr Met Gly Arg Thr 
65 70 75 80 

Ser Phe Asp Ser Asp Ser Trp Thr Leu Arg Leu His Asn Leu Gin lie 
85 90 95 

Lys Asp Lys Gly Leu Tyr Gin Cys lie lie His His Lys Lys Pro Thr 
100 105 110 

Gly Met lie Arg lie His Gin Met Asn Ser Glu Leu Ser Val Leu Ala 
115 120 125 

Asn Phe Ser Gin Pro Glu lie Val Pro lie Ser Asn lie Thr Glu Asn 
130 135 140 

Val Tyr lie Asn Leu Thr Cys Ser Ser lie His Gly Tyr Pro Glu Pro 
145 150 155 160 

Lys Lys Met Ser Val Leu Leu Arg Thr Lys Asn Ser Thr lie Glu Tyr 
165 170 175 

Asp Gly lie Met Gin Lys Ser Gin Asp Asn Val Thr Glu Leu Tyr Asp 
180 185 190 

Val Ser He Ser Leu Ser Val Ser Phe Pro Asp Val Thr Ser Asn Met 
195 200 205 

Thr He Phe Cys He Leu Glu Thr Asp Lys Thr Arg Leu Leu Ser Ser 



6 



210 215 220 

Pro Phe Ser He Glu Leu Glu Asp Pro Gin Pro Pro Pro Asp His He 
225 230 235 240 

Pro Trp He Thr Ala Val Leu Pro Thr Val He He Cys Val Met Val 
245 250 255 

Phe Cys Leu He Leu Trp Lys Trp Lys Lys Lys Lys Arg Pro Arg Asn 
260 265 270 

Ser Tyr Lys Cys Gly Thr Asn Thr Met Glu Arg Glu Glu Ser Glu Gin 
275 280 285 

Thr Lys Lys Arg Glu Lys He His He Pro Glu Arg Ser Asp Glu Ala 
290 295 300 

Gin Arg Val Phe Lys Ser Ser Lys Thr Ser Ser Cys Asp Lys Ser Asp 
305 310 315 320 

Thr Cys Phe 



<210> 6 
<211> 290 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Met Arg He Phe Ala Val Phe He Phe Met Thr Tyr Trp His Leu Leu 
15 10 15 

Asn Ala Phe Thr Val Thr Val Pro Asp Lys Leu Tyr Val Val Glu Tyr 
20 25 30 

Gly Ser Asn Met Thr He Glu Cys Lys Phe Pro Val Glu Lys Gin Leu 
35 40 45 

Asp Leu Ala Ala Leu He Val Tyr Trp Glu Met Glu Asp Lys Asn He 
50 55 60 

He Gin Phe Val His Gly Glu Glu Asp Leu Lys Val Gin His Ser Ser 
65 70 75 80 

Tyr Arg Gin Arg Ala Arg Leu Leu Lys Asp Gin Leu Ser Leu Gly Asn 
85 90 95 

Ala Ala Leu Gin He Thr Asp Val Lys Leu Gin Asp Ala Gly Val Tyr 
100 105 110 

Arg Cys Met He Ser Tyr Gly Gly Ala Asp Tyr Lys Arg He Thr Val 
115 120 125 

Lys Val Asn Ala Pro Tyr Asn Lys lie Asn Gin Arg He Leu Val Val 
130 135 140 



7 



Asp Pro Val Thr 
145 

Pro Lys Ala Glu 



Gly Lys Thr Thr 
180 

Val Thr Ser Thr 
195 

Cys Thr Phe Arg 
210 

Val He Pro Glu 
225 

Leu Val He Leu 



fc f! Phe He Phe Arg 

Gly He Gin Asp 

VI 275 

U'i Glu Thr 
£0 290 



j% <210> 7 

H <211> 302 

r ll <212> PRT 

zll <213> Homo sapiens 

H <400> 7 

Met Arg Leu Gly Ser Pro Gly Leu Leu Phe Leu Leu Phe Ser Ser Leu 
15 10 15 

Arg Ala Asp Thr Gin Glu Lys Glu Val Arg Ala Met Val Gly Ser Asp 
20 25 30 

Val Glu Leu Ser Cys Ala Cys Pro Glu Gly Ser Arg Phe Asp Leu Asn 
35 40 45 

Asp Val Tyr Val Tyr Trp Gin Thr Ser Glu Ser Lys Thr Val Val Thr 
50 55 60 

Tyr His He Pro Gin Asn Ser Ser Leu Glu Asn Val Asp Ser Arg Tyr 
65 70 75 80 

Arg Asn Arg Ala Leu Met Ser Pro Ala Gly Met Leu Arg Gly Asp Phe 
85 90 95 

Ser Leu Arg Leu Phe Asn Val Thr Pro Gin Asp Glu Gin Lys Phe His 
100 105 110 



Ser Glu 
150 

Val He 
165 

Thr Thr 



Leu Arg 



Arg Leu 



Leu Pro 
230 

Gly Ala 
245 

Leu Arg 



Thr Asn 



His Glu 

Trp Thr 

Asn Ser 

He Asn 
200 

Asp Pro 
215 

Leu Ala 



He Leu 



Lys Gly 



Ser Lys 
280 



Leu Thr 

Ser Ser 
170 

Lys Arg 
185 

Thr Thr 

Glu Glu 

His Pro 

Leu Cys 
250 

Arg Met 
265 

Lys Gin 



Cys Gin 
155 

Asp His 

Glu Glu 

Thr Asn 

Asn His 
220 

Pro Asn 
235 

Leu Gly 
Met Asp 
Ser Asp 



Ala Glu 
Gin Val 

Lys Leu 

190 

Glu He 
205 

Thr Ala 

Glu Arg 

Val Ala 

Val Lys 
270 

Thr His 
285 



Gly Tyr 
160 

Leu Ser 
175 

Phe Asn 

Phe Tyr 

Glu Leu 

Thr His 
240 

Leu Thr 
255 

Lys Cys 
Leu Glu 



8 



Cys Leu Val Leu Ser Gin Ser Leu Gly Phe Gin Glu Val Leu Ser Val 
115 120 125 

Glu Val Thr Leu His Val Ala Ala Asn Phe Ser Val Pro Val Val Ser 
130 135 140 

Ala Pro His Ser Pro Ser Gin Asp Glu Leu Thr Phe Thr Cys Thr Ser 
145 150 155 160 

He Asn Gly Tyr Pro Arg Pro Asn Val Tyr Trp He Asn Lys Thr Asp 
165 170 175 

Asn Ser Leu Leu Asp Gin Ala Leu Gin Asn Asp Thr Val Phe Leu Asn 
180 185 190 

Met Arg Gly Leu Tyr Asp Val Val Ser Val Leu Arg He Ala Arg Thr 
195 200 205 

Pro Ser Val Asn He Gly Cys Cys He Glu Asn Val Leu Leu Gin Gin 
210 215 220 

Asn Leu Thr Val Gly Ser Gin Thr Gly Asn Asp He Gly Glu Arg Asp 
225 230 235 240 

Lys He Thr Glu Asn Pro Val Ser Thr Gly Glu Lys Asn Ala Ala Thr 
245 250 255 

Trp Ser He Leu Ala Val Leu Cys Leu Leu Val Val Val Ala Val Ala 
260 265 270 

He Gly Trp Val Cys Arg Asp Arg Cys Leu Gin His Ser Tyr Ala Gly 
275 280 285 

Ala Trp Ala Val Ser Pro Glu Thr Glu Leu Thr Gly His Val 
290 295 300 



<210> 8 
<211> 316 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> (233) 

<223> "Xaa" can be any naturally-occurring amino acid 
<400> 8 

Met Leu Arg Arg Arg Gly Ser Pro Gly Met Gly Val His Val Gly Ala 
15 10 15 

Ala Leu Gly Ala Leu Trp Phe Cys Leu Thr Gly Ala Leu Glu Val Gin 
20 25 30 

Val Pro Glu Asp Pro Val Val Ala Leu Val Gly Thr Asp Ala Thr Leu 
35 40 45 



9 



Cys Cys Ser Phe Ser Pro Glu Pro Gly Phe Ser Leu Ala Gin Leu Asn 
50 55 60 



Leu lie Trp Gin Leu Thr Asp Thr Lys Gin Leu Val His Ser Phe Ala 
65 70 75 80 

Glu Gly Gin Asp Gin Gly Ser Ala Tyr Ala Asn Arg Thr Ala Leu Phe 
85 90 95 

Pro Asp Leu Leu Ala Gin Gly Asn Ala Ser Leu Arg Leu Gin Arg Val 
100 105 110 

Arg Val Ala Asp Glu Gly Ser Phe Thr Cys Phe Val Ser He Arg Asp 
115 120 125 

Phe Gly Ser Ala Ala Val Ser Leu Gin Val Ala Ala Pro Tyr Ser Lys 
130 135 140 

Pro Ser Met Thr Leu Glu Pro Asn Lys Asp Leu Arg Pro Gly Asp Thr 
145 150 155 160 

Val Thr He Thr Cys Ser Ser Tyr Gin Gly Tyr Pro Glu Ala Glu Val 
165 170 175 

Phe Trp Gin Asp Gly Gin Gly Val Pro Leu Thr Gly Asn Val Thr Thr 
180 185 190 

Ser Gin Met Ala Asn Glu Gin Gly Leu Phe Asp Val His Ser Val Leu 
195 200 205 

Arg Val Val Leu Gly Ala Asn Gly Thr Tyr Ser Cys Leu Val Arg Asn 
210 215 220 

Pro Val Leu Gin Gin Asp Ala His Xaa Ser Val Thr He Thr Gly Gin 
225 230 235 240 

Pro Met Thr Phe Pro Pro Glu Ala Leu Trp Val Thr Val Gly Leu Ser 
245 250 255 

Val Cys Leu He Ala Leu Leu Val Ala Leu Ala Phe Val Cys Trp Arg 
260 265 270 

Lys He Lys Gin Ser Cys Glu Glu Glu Asn Ala Gly Ala Glu Asp Gin 
275 280 285 

Asp Gly Glu Gly Glu Gly Ser Lys Thr Ala Leu Gin Pro Leu Lys His 
290 295 300 



Ser Asp Ser Lys Glu Asp Asp Gly Gin Glu He Ala 
305 310 315 



<210> 9 
<211> 276 
<212> PRT 

<213> Homo sapiens 



10 



* 



<400> 9 

Met Glu Ser Ala Ala Ala Leu His Phe Ser Arg Pro Ala Ser Leu Leu 
15 10 15 

Leu Leu Leu Leu Ser Leu Cys Ala Leu Val Ser Ala Gin Phe lie Val 
20 25 30 

Val Gly Pro Thr Asp Pro He Leu Ala Thr Val Gly Glu Asn Thr Thr 
35 40 45 

Leu Arg Cys His Leu Ser Pro Glu Lys Asn Ala Glu Asp Met Glu Val 
50 55 60 

Arg Trp Phe Arg Ser Gin Phe Ser Pro Ala Val Phe Val Tyr Lys Gly 
65 70 75 80 

Gly Arg Glu Arg Thr Glu Glu Gin Met Glu Glu Tyr Arg Gly Arg Thr 
85 90 95 

Thr Phe Val Ser Lys Asp He Ser Arg Gly Ser Val Ala Leu Val He 
100 105 110 

His Asn He Thr Ala Gin Glu Asn Gly Thr Tyr Arg Cys Tyr Phe Gin 
115 120 125 

Glu Gly Arg Ser Tyr Asp Glu Ala He Leu His Leu Val Val Ala Gly 
130 135 140 

Leu Gly Ser Lys Pro Leu He Ser Met Arg Gly His Glu Asp Gly Gly 
145 150 155 160 

He Arg Leu Glu Cys He Ser Arg Gly Trp Tyr Pro Lys Pro Leu Thr 
165 170 175 

Val Trp Arg Asp Pro Tyr Gly Gly Val Ala Pro Ala Leu Lys Glu Val 
180 185 190 

Ser Met Pro Asp Ala Asp Gly Leu Phe Met Val Thr Thr Ala Val He 
195 200 205 

He Arg Asp Lys Ser Val Arg Asn Met Ser Cys Ser He Asn Asn Thr 
210 215 220 

Leu Leu Gly Gin Lys Lys Glu Ser Val He Phe He Pro Glu Ser Phe 
225 230 235 240 

Met Pro Ser Val Ser Pro Cys Ala Val Ala Leu Pro He He Val Val 
245 250 255 

He Leu Met He Pro He Ala Val Cys He Tyr Trp He Asn Lys Leu 
260 265 270 

Gin Lys Glu Lys 
275 



11 



<210> 10 
<211> 523 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Met Glu Pro Ala Ala Ala Leu His Phe Ser Leu Pro Ala Ser Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Leu Leu Leu Ser Leu Cys Ala Leu Val Ser Ala 
20 25 30 

Gin Phe Thr Val Val Gly Pro Ala Asn Pro He Leu Ala Met Val Gly 
35 40 45 

Glu Asn Thr Thr Leu Arg Cys His Leu Ser Pro Glu Lys Asn Ala Glu 
50 55 60 

Asp Met Glu Val Arg Trp Phe Arg Ser Gin Phe Ser Pro Ala Val Phe 
65 70 75 80 

Val Tyr Lys Gly Gly Arg Glu Arg Thr Glu Glu Gin Met Glu Glu Tyr 
85 90 95 

Arg Gly Arg He Thr Phe Val Ser Lys Asp He Asn Arg Gly Ser Val 
100 105 110 

Ala Leu Val He His Asn Val Thr Ala Gin Glu Asn Gly He Tyr Arg 
115 120 125 

Cys Tyr Phe Gin Glu Gly Arg Ser Tyr Asp Glu Ala He Leu Arg Leu 
130 135 140 

Val Val Ala Gly Leu Gly Ser Lys Pro Leu He Glu He Lys Ala Gin 
145 150 155 160 

Glu Asp Gly Ser He Trp Leu Glu Cys He Ser Gly Gly Trp Tyr Pro 
165 170 175 

Glu Pro Leu Thr Val Trp Arg Asp Pro Tyr Gly Glu Val Val Pro Ala 
180 185 190 

Leu Lys Glu Val Ser He Ala Asp Ala Asp Gly Leu Phe Met Val Thr 
195 200 205 

Thr Ala Val He He Arg Asp Lys Tyr Val Arg Asn Val Ser Cys Ser 
210 215 220 

Val Asn Asn Thr Leu Leu Gly Gin Glu Lys Glu Thr Val He Phe He 
225 230 235 240 

Pro Glu Ser Phe Met Pro Ser Ala Ser Pro Trp Met Val Ala Leu Ala 
245 250 255 

Val He Leu Thr Ala Ser Pro Trp Met Val Ser Met Thr Val He Leu 
260 265 270 



12 



Ala Val Phe lie lie Phe Met Ala Val Ser lie Cys Cys lie Lys Lys 
275 280 285 

Leu Gin Arg Glu Lys Lys He Leu Ser Gly Glu Lys Lys Val Glu Gin 
290 295 300 

Glu Glu Lys Glu He Ala Gin Gin Leu Gin Glu Glu Leu Arg Trp Arg 
305 310 315 320 

Arg Thr Phe Leu His Ala Ala Asp Val Val Leu Asp Pro Asp Thr Ala 
325 330 335 

His Pro Glu Leu Phe Leu Ser Glu Asp Arg Arg Ser Val Arg Arg Gly 
340 345 350 

Pro Tyr Arg Gin Arg Val Pro Asp Asn Pro Glu Arg Phe Asp Ser Gin 
355 360 365 

Pro Cys Val Leu Gly Trp Glu Ser Phe Ala Ser Gly Lys His Tyr Trp 
370 375 380 

Glu Val Glu Val Glu Asn Val Met Val Trp Thr Val Gly Val Cys Arg 
385 390 395 400 

His Ser Val Glu Arg Lys Gly Glu Val Leu Leu He Pro Gin Asn Gly 
405 410 415 

Phe Trp Thr Leu Glu Met Phe Gly Asn Gin Tyr Arg Ala Leu Ser Ser 
420 425 430 

Pro Glu Arg He Leu Pro Leu Lys Glu Ser Leu Cys Arg Val Gly Val 
435 440 445 

Phe Leu Asp Tyr Glu Ala Gly Asp Val Ser Phe Tyr Asn Met Arg Asp 
450 455 460 

Arg Ser His He Tyr Thr Cys Pro Arg Ser Ala Phe Thr Val Pro Val 
465 470 475 480 

Arg Pro Phe Phe Arg Leu Gly Ser Asp Asp Ser Pro He Phe He Cys 
485 490 495 

Pro Ala Leu Thr Gly Ala Ser Gly Val Met Val Pro Glu Glu Gly Leu 
500 505 510 



Lys Leu His Arg Val Gly Thr His Gin Ser Leu 
515 520 



<210> 11 
<211> 263 
<212> PRT 

<213> Homo sapiens 
<400> 11 

Phe His Val Ser Leu Leu Leu Val Gin Leu Leu Thr Pro Cys Ser Ala 
15 10 15 
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Gin Phe Ser Val Leu Gly Pro Ser Gly Pro He Leu Ala Met Val Gly 
20 25 30 



Glu Asp Ala Asp Leu 
35 

Thr Met Glu Leu Lys 
50 

Val Tyr Ala Asp Gly 
65 

Arg Gly Arg Thr Ser 
85 

Ala Leu Arg He His 
100 

Cys Tyr Phe Gin Asp 
115 

Lys Val Ala Ala Leu 
130 

Glu Asp Gly Gly He 
145 

Gin Pro Gin He Gin 

165 

Val Glu Ala Pro Val 
180 

Ala Ser Val He Met 
195 

He Arg Asn Ser Leu 
210 

Ala Asp Pro Phe Phe 
225 

Gly Thr Leu Pro He 
245 

Leu Trp Arg Gin Gin 
260 



Pro Cys His Leu Phe Pro 
40 

Trp Val Ser Ser Ser Leu 
55 

Lys Glu Val Glu Asp Arg 
70 75 

He Leu Arg Asp Gly He 
90 

Asn Val Thr Ala Ser Asp 
105 

Gly Asp Phe Tyr Glu Lys 
120 

Gly Ser Asn Leu His Val 
135 

His Leu Glu Cys Arg Ser 
150 155 

Trp Ser Asn Ala Lys Gly 
170 

Val Ala Asp Gly Val Gly 
185 

Arg Gly Gly Ser Gly Glu 
200 

Leu Gly Leu Glu Lys Thr 
215 

Arg Ser Ala Gin Pro Trp 
230 235 

Leu Leu Leu Leu Leu Ala 
250 

Lys Glu 



Thr Met Ser Ala Glu 
45 

Arg Gin Val Val Asn 
60 

Gin Ser Ala Pro Tyr 
80 

Thr Ala Gly Lys Ala 
95 

Ser Gly Lys Tyr Leu 
110 

Ala Leu Val Glu Leu 
125 

Glu Val Lys Gly Tyr 
140 

Thr Gly Trp Tyr Pro 
160 

Glu Asn He Pro Ala 
175 

Leu Tyr Glu Val Ala 
190 

Gly Val Ser Cys He 
205 

Ala Ser He Ser He 
220 

He Ala Ala Leu Ala 
240 

Gly Ala Ser Tyr Phe 
255 



<210> 12 
<211> 584 
<212> PRT 

<213> Homo sapiens 
<400> 12 

Met Lys Met Ala Ser Ser Leu Ala Phe Leu Leu Leu Asn Phe His Val 



14 



10 



15 



Ser Leu Phe Leu Val Gin Leu Leu Thr Pro Cys Ser Ala Gin Phe Ser 
20 25 30 

Val Leu Gly Pro Ser Gly Pro He Leu Ala Met Val Gly Glu Asp Ala 
35 40 45 

Asp Leu Pro Cys His Leu Phe Pro Thr Met Ser Ala Glu Thr Met Glu 
50 55 60 

Leu Arg Trp Val Ser Ser Ser Leu Arg Gin Val Val Asn Val Tyr Ala 
65 70 75 80 

Asp Gly Lys Glu Val Glu Asp Arg Gin Ser Ala Pro Tyr Arg Gly Arg 
85 90 95 

Thr Ser He Leu Arg Asp Gly He Thr Ala Gly Lys Ala Ala Leu Arg 
100 105 110 

He His Asn Val Thr Ala Ser Asp Ser Gly Lys Tyr Leu Cys Tyr Phe 
115 120 125 

Gin Asp Gly Asp Phe Tyr Glu Lys Ala Leu Val Glu Leu Lys Val Ala 
130 135 140 

Ala Leu Gly Ser Asp Leu His He Glu Val Lys Gly Tyr Glu Asp Gly 
145 150 155 160 

Gly He His Leu Glu Cys Arg Ser Thr Gly Trp Tyr Pro Gin Pro Gin 
165 170 175 

He Lys Trp Ser Asp Thr Lys Gly Glu Asn He Pro Ala Val Glu Ala 
180 185 190 

Pro Val Val Ala Asp Gly Val Gly Leu Tyr Ala Val Ala Ala Ser Val 
195 200 205 

He Met Arg Gly Ser Ser Gly Gly Gly Val Ser Cys He He Arg Asn 
210 215 220 

Ser Leu Leu Gly Leu Glu Lys Thr Ala Ser He Ser He Ala Asp Pro 
225 230 235 240 

Phe Phe Arg Ser Ala Gin Pro Trp He Ala Ala Leu Ala Gly Thr Leu 
245 250 255 

Pro He Ser Leu Leu Leu Leu Ala Gly Ala Ser Tyr Phe Leu Trp Arg 
260 265 270 

Gin Gin Lys Glu Lys He Ala Leu Ser Arg Glu Thr Glu Arg Glu Arg 
275 280 285 

Glu Met Lys Glu Met Gly Tyr Ala Ala Thr Glu Gin Glu He Ser Leu 
290 295 300 



Arg Glu Lys Leu Gin Glu Glu Leu Lys Trp Arg Lys He Gin Tyr Met 
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305 



310 



315 



320 



Ala Arg Gly Glu Lys Ser Leu Ala Tyr His Glu Trp Lys Met Ala Leu 
325 330 335 

Phe Lys Pro Ala Asp Val lie Leu Asp Pro Asp Thr Ala Asn Ala lie 
340 345 350 

Leu Leu Val Ser Glu Asp Gin Arg Ser Val Gin Arg Ala Glu Glu Pro 
355 360 365 

Arg Asp Leu Pro Asp Asn Pro Glu Arg Phe Glu Trp Arg Tyr Cys Val 
370 375 380 

Leu Gly Cys Glu Asn Phe Thr Ser Gly Arg His Tyr Trp Glu Val Glu 
385 390 395 400 

Val Gly Asp Arg Lys Glu Trp His lie Gly Val Cys Ser Lys Asn Val 
405 410 415 

Glu Arg Lys Lys Gly Trp Val Lys Met Thr Pro Glu Asn Gly Tyr Trp 
420 425 430 

Thr Met Gly Leu Thr Asp Gly Asn Lys Tyr Arg Ala Leu Thr Glu Pro 
435 440 445 

Arg Thr Asn Leu Lys Leu Pro Glu Pro Pro Arg Lys Val Gly lie Phe 
450 455 460 

Leu Asp Tyr Glu Thr Gly Glu lie Ser Phe Tyr Asn Ala Thr Asp Gly 
465 470 475 480 

Ser His lie Tyr Thr Phe Pro His Ala Ser Phe Ser Glu Pro Leu Tyr 
485 490 495 

Pro Val Phe Arg lie Leu Thr Leu Glu Pro Thr Ala Leu Thr lie Cys 
500 505 510 

Pro lie Pro Lys Glu Val Glu Ser Ser Pro Asp Pro Asp Leu Val Pro 
515 520 525 

Asp His Ser Leu Glu Thr Pro Leu Thr Pro Gly Leu Ala Asn Glu Ser 
530 535 540 

Gly Glu Pro Gin Ala Glu Val Thr Ser Leu Leu Leu Pro Ala His Pro 
545 550 555 560 

Gly Ala Glu Val Ser Pro Ser Ala Thr Thr Asn Gin Asn His Lys Leu 
565 570 575 

Gin Ala Arg Thr Glu Ala Leu Tyr 

580 



<210> 13 
<211> 526 
<212> PRT 
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<213> Homo sapiens 



<400> 13 

Met Ala Val Phe Pro Ser Ser Gly Leu Pro Arg Cys Leu Leu Thr Leu 
15 10 15 

He Leu Leu Gin Leu Pro Lys Leu Asp Ser Ala Pro Phe Asp Val He 
20 25 30 

Gly Pro Pro Glu Pro He Leu Ala Val Val Gly Glu Asp Ala Glu Leu 
35 40 45 

Pro Cys Arg Leu Ser Pro Asn Ala Ser Ala Glu His Leu Glu Leu Arg 
50 55 60 

Trp Phe Arg Lys Lys Val Ser Pro Ala Val Leu Val His Arg Asp Gly 
65 70 75 80 

Arg Glu Gin Glu Ala Glu Gin Met Pro Glu Tyr Arg Gly Arg Ala Thr 
85 90 95 

Leu Val Gin Asp Gly He Ala Lys Gly Arg Val Ala Leu Arg He Arg 
100 105 110 

Gly Val Arg Val Ser Asp Asp Gly Glu Tyr Thr Cys Phe Phe Arg Glu 
115 120 125 

Asp Gly Ser Tyr Glu Glu Ala Leu Val His Leu Lys Val Ala Ala Leu 
130 135 140 

Gly Ser Asp Pro His He Ser Met Gin Val Gin Glu Asn Gly Glu He 
145 150 155 160 

Cys Leu Glu Cys Thr Ser Val Gly Trp Tyr Pro Glu Pro Gin Val Gin 
165 170 175 

Trp Arg Thr Ser Lys Gly Glu Lys Phe Pro Ser Thr Ser Glu Ser Arg 
180 185 190 

Asn Pro Asp Glu Glu Gly Leu Phe Thr Val Ala Ala Ser Val He He 
195 200 205 

Arg Asp Thr Ser Thr Lys Asn Val Ser Cys Tyr He Gin Asn Leu Leu 
210 215 220 

Leu Gly Gin Glu Lys Lys Val Glu He Ser He Pro Ala Ser Ser Leu 
225 230 235 240 

Pro Arg Leu Thr Pro Trp He Val Ala Val Ala Val He Leu Met Val 
245 250 255 

Leu Gly Leu Leu Thr He Gly Ser He Phe Phe Thr Trp Arg Leu Tyr 
260 265 270 



Asn Glu Arg Pro Arg Glu Arg Arg Asn Glu Phe Ser Ser Lys Glu Arg 
275 280 285 
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Leu Leu Glu Glu Leu Lys Trp Lys Lys Ala Thr Leu His Ala Val Asp 
290 295 300 



Val Thr Leu Asp Pro Asp Thr Ala His Pro His Leu Phe Leu Tyr Glu 
305 310 315 320 

Asp Ser Lys Ser Val Arg Leu Glu Asp Ser Arg Gin Lys Leu Pro Glu 
325 330 335 

Lys Thr Glu Arg Phe Asp Ser Trp Pro Cys Val Leu Gly Arg Glu Thr 
340 345 350 

Phe Thr Ser Gly Arg His Tyr Trp Glu Val Glu Val Gly Asp Arg Thr 
355 360 365 

Asp Trp Ala lie Gly Val Cys Arg Glu Asn Val Met Lys Lys Gly Phe 
370 375 380 

Asp Pro Met Thr Pro Glu Asn Gly Phe Trp Ala Val Glu Leu Tyr Gly 
385 390 395 400 

Asn Gly Tyr Trp Ala Leu Thr Pro Leu Arg Thr Pro Leu Pro Leu Ala 
405 410 415 

Gly Pro Pro Arg Arg Val Gly lie Phe Leu Asp Tyr Glu Ser Gly Asp 
420 425 430 

lie Ser Phe Tyr Asn Met Asn Asp Gly Ser Asp lie Tyr Thr Phe Ser 
435 440 445 

Asn Val Thr Phe Ser Gly Pro Leu Arg Pro Phe Phe Cys Leu Trp Ser 
450 455 460 

Ser Gly Lys Lys Pro Leu Thr lie Cys Pro lie Ala Asp Gly Pro Glu 
465 470 475 480 

Arg Val Thr Val lie Ala Asn Ala Gin Asp Leu Ser Lys Glu lie Pro 
485 490 495 

Leu Ser Pro Met Gly Glu Glu Ser Ala Pro Arg Asp Ala Asp Thr Leu 
500 505 510 



His Ser Lys Leu lie Pro Thr Gin Pro Ser Gin Gly Ala Pro 
515 520 525 



<210> 14 

<211> 7819 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> exon 

<222> (4599) . . (4652) 

<223> coding portion of exon 1 

<220> 
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<221> unsure 
<222> (145) . . (292) 
<223> "N" can be A, C, 



T, or G 



<220> 

<221> unsure 

<222> (1977) . . (2459) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (2465) . . (2576) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (2593) . . (3410) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (3490) . . (3533) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (6111) . . (6194) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (6836) . . (6870) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (7091) . . (7112) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (7118) . . (7420) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (7554) . . (7575) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (7694) . . (7718) 

<223> "N" can be A, C, T, or G 

<400> 14 

ataagaagct gaattaaggt gatggcagtg gggtggaaga aaggagagcc accatgcaaa 
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aagtatccag gagggagaat taacaggact 
atgcagaggt ctaagattct agctnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
catattttcc ccagaggagg attagtagga 
tagcagtgtc tgttcctcgg tttgctcaag 
agggcaagga agaaggggaa gttaaagcca 
gctagagcta tgttacatgc tgtcctttca 
caggcctgtt ggtgcagctt aagatgatac 
gaaggctatc ttctggtcaa gctaaggtat 
aattctgttc ccagttgtta cctgctgtaa 
acagctcact gaggacctta catttccctc 
tatagcactc ttcttgagag agttgcctat 
ttgaacctat tcaagctttc atctgtacaa 
tccatgacct ccatactgct aaatgcaact 
ctttgtagca ctgtgtttta aaatataggt 
tatatcagga aatgactgtc gttgaaaaaa 
gggggcacac catgccacaa agggccacat 
gtggggggtg cgcaagatct ttattgtggt 
gagtggattt aggattagct gatataaata 
tccctagtct tctggtactt ggccctgggg 
gtgaaagccc ccaataaatg aggcagttgt 
atttgaaagg tgtgctcatg ggcaagtggt 
accctgggag cggcagtccc ttcagggtca 
acagaaaata aaatgcatgg ataatacaca 
atcttctctg ctggtgaccg ctcttcacaa 
tcagtccttg ggcactctct ttcctatctc 
actttaaatc tttaacactt ctgcattgat 
gcctactaaa cacctccact tggctatcta 



a ggggatggg ccatatttgc aagatgagaa 120 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 180 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 24 0 
nnnnnnnnnn nnnnnnnnnn nngcataaac 300 
aaggaagctg ctggttggaa agtatcttta 360 
gggacagtgt gccaggaaag tccccgtgga 420 
gtggcaggtg atccaagaat cttttctgtt 4 80 
tgctctaaaa ataagagtgc tggcaagtgc 540 
ctttcttgga tatatatgca tctgaataag 600 
gccatgagca tttccctgtg gaaagcactt 660 
gatctccctt tctaaaataa aaacaagaat 720 
tagctactga ctcatttctc ttctcctttt 780 
atttgttgcc acatctttac ccattctctt 840 
aactcactga tactgtgctt gtcaggatca 900 
ctcaagagta tttggctcta ctgatcactc 960 
tttattatta tttaggtatg gtgaggccaa 1020 
gtatgttgta ctcacagatc ccaagagaag 1080 
ggggaagcac cagggtcagc caggaggtgg 1140 
ttcaacagga agaaatgggt gaagcagggt 1200 
atttcagcag gctctggggc ataggggctg 12 60 
tgattaaggc agttgcatag tgttgggaat 1320 
gggtatgggc tctgaaatgg gttggtttgc 1380 
ttactctctc ttagaggtta gaattggcta 1440 
gcaaggcccc aggtgtcaaa gcatcagaat 1500 
ctgccatttg cctttgtacc cttcctttca 1560 
agatctataa atgttggaat accccatgtc 1620 
tctgtaggtg atgtaatgca gatatccatg 1680 
gactcctaaa tttacatctc taccccaact 1740 
ataggcattt caaaccaaat ctacaacaaa 1800 
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cgtaactctt tttccccttc cttaatttgc ttctccccca gccttctcca ttttaataaa 1860 
cagcatctcc attgccttag tgactcaagc cccaaactta ggaattttcc cagatttccc 1920 
tctttttctc aaactatata tctagcctgt cagcagttcc cttcaggtct tttttcnnnn 1980 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2040 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2100 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2160 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2220 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 22 80 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2340 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 24 00 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnna 24 60 
ctaannnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2520 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnngctc 2580 
catttatatt tannnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2640 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2700 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 27 60 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2820 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2880 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2 94 0 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3000 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 30 60 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3120 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3180 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3240 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3300 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3360 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ttgtgtttta 3420 
aatatatata cacacttaga cacatataac cctctttcgt atatcaatta tactttaata 3480 
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aagctgttgn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnctcgcca 


Jb4(J 


acagaccact 


tctcacccct 


actagcccct 


ctactctaag 


ctatcagcat 


ttctgeaaac 


obUU 


acattcctaa 


ccgatctcac 


tgcttgtaat 


cttgccagca 


acctctccct 


ctcagcaata 


"3 C C C\ 


gtctattgcc 


tacaccaaag 


cttagttgtc 


tcttaatgat 


gtaaatgagg 


ttctatcatt 


6 /zU 


ctcctgaccc 


aaaccctcca 


ctgettttea 


tcacactcag 


agcagctctg 


ctgttgcctg 


O 1 O f~\ 


atttagatgt 


atggctccaa 


cagatttccc 


ctgaagaaat 


gattccatgg 


ctgataaaag 


3840 


ttggaaagcc 


tcctcagttt 


cagaccatta 


tcagattagc 


tgtgtgctct 


gtccctttcc 


o o r\ r\ 


tcaaccataa 


gaagtccatg 


gataaagaaa 


gcttcagagt 


aaaggagaaa 


gcatgggagg 


3960 


tacagcagga 


ccaaggtggg 


geattegcag 


cccccaccct 


catcagagcc 


agttccctac 


4020 


tctccctgtc 


taaacctctt 


agtaagaggt 


agttcaagag 


aggggcaaac 


tcaattccag 


4080 


cactcaaaag 


cacttgacta 


etttgetcag 


tcaactagca 


agtatttatt 


gagaatgtag 


4140 


ctctgttcta 


tggagtctta 


ttttcaagtg 


tcagactccc 


agacatccag 


tccaggtaaa 


4200 


gaagatggtg 


tccattattc 


atttgacaaa 


caaagttggg 


gttcaagggc 


cagctattga 


4260 


aaaaagctat 


ggaaagcttc 


atgagacgtg 


caggtaactg 


ccaatatgtg 


tggttcacaa 


4320 


ggactggttc 


atattcagaa 


aeggecatta 


gaaaaggaag 


aagaacttct 


catttggatt 


4380 


tataaagagt 


gtcttgttta 


ctcttaattt 


atatcttctc 


ttctccagga 


aatcaaccta 


4440 


taacttctcc 


tcccagctcc 


actctaccat 


ggtctgtcac 


cttccccaaa 


tgatttgtta 


4500 


ttcccctgtt 


ttcaaaagtg 


aacaaagaac 


caaagaccca 


gcaaagtttc 


acaaggccct 


4560 


gagactttca 


attgtctatt 


tcagatcaaa 


tacagaac atg ate ttc etc ctg eta 


4616 


atg ttg age ctg gaa ttg cag ctt cac cag ata 


gca ggtaagaaag 


4662 


gacaaaggga 


gaggcttaag 


aaagaagagc 


aggtggtggt 


tcctagccaa 


agecaaaaat 


4722 


gagaatgtgg 


ccctcaggct 


gagggctttc 


tttgagagga 


cgtatgattt 


ctgggctatt 


4782 


ccaagcacca 


caaaaaaaaa 


aagagtcccc 


atggtggctt 


atacatgeca 


atgtccctat 


4842 


ctgacagaaa 


cggtgactga 


gaatattget 


ccatctattc 


ccactatcca 


gtgagggtaa 


4902 


tgacaagaag 


acaggatcac 


tcagaccatg 


taaatctaaa 


ctgatacaag 


agggcagggg 


4962 


ttgagttccc 


ttaaaggtga 


gatgecaage 


agctgtcccc 


ttcctttctg 


gcagggagag 


5022 


taaggagaca 


atggccaggg 


aacacegtta 


ctctaaagat 


aatgtcttga 


agacattctg 


5082 


catattatta 


gttgtttctg 


tgagtttctt 


ttttgaaaag 


caacaatagc 


agccgttggt 


5142 


cattcatacc 


ttaatgtggt 


ttactgagtc 


ttcctaaaac 


ccaaatgaac 


aatgaacctt 


5202 
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aaggctatcc 


ctttggactt 


gaagaaagga 


gaaaagcagt 


ttcacagttg 


gttgttctcc 


gtagttagaa 


ccatgagtgc 


actattttgg 


gaggtgtgga 


cagaggatta 


agtcctcaag 


tttctgctca 


tgattgagaa 


ataatggctc 


ttctcctccc 


tactagctgg 


ttccagatca 


cttgagctgc 


ccagcctggt 


ctggcactag 


agaccctgag 


atattctgct 


tacttctact 


acttttcatg 


ttttctttgc 


aaggaaccct 


ggggcaataa 


cttccttgct 


acaaaacagc 


gagggccaaa 


agacactgtt 


ccccatcctg 


tggggaatgt 


gtaacctctg 


tgggtttgta 


gtctttgttt 


tttgcctttt 


ctcttaggaa 


ttttttatcc 


ctcattggct 


ctttctacag 


cacaagacaa 


tgaagatagg 


cactagctcc 


gagcatttnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


gaaaaggtac 


ttaatttccc 


caagattaca 


acaaagctaa 


tgtcctttct 


actttactgt 


agacaaccca 


taaatcttcc 


aactttggaa 


tacgtattga 


atgtttattc 


tggatattca 


gttgctgaat 


taactgaaag 


gaaagtaaaa 


ttggaattct 


tttctgcttt 


ccactatgca 


cctacctaaa 


ttagaatgcc 


ttctcctcat 




t aaagagt tt 


tgatttcaga 


aaactaaagg 


atctgagaaa 


taattagtgt 


tgggggtaaa 


aaggatagtc 


ttatctatta 


agcatgtatt 


aggnnnnnnn 


nnnnnnnnnn 



cttctattgg 


aggatgaggg 


tgagcagaaa 


5262 


tggggaaggt 


agttcagacc 


attcgagggt 


5322 


atgaacacca 


ggagctaaga 


gagtaacata 


5382 


acaatagccc 


cagccccatg 


ggaaatcatc 


5442 


ccttggcact 


tgataacctt 


tcgaagagct 


5502 


ctcttcaccc 


agtcacattc 


ctctcactca 


5562 


agacatgcac 


ttggggccct 


cctcaaagga 


5622 


ctgctcctgc 


ctgcagggcc 


agctaaagga 


5682 


gcctggctgg 


cattttagag 


acaagcaaaa 


5742 


ttcaagtttc 


catagagtga 


taagggaaat 


5802 


tggcaggact 


gggggcttca 


ggagaaaact 


5862 


gcttaaaaac 


actgagatcc 


tgggttttct 


5922 


aggagtgagc 


tagggtgaca 


aggggcaaca 


5982 


aggaaggatc 


ttttcttcta 


agataatcag 


6042 


cagttaggta 


tactaatggg 


gcaaaaggaa 


6102 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


6162 


nnattttgca 


ggtgagggaa 


ctagaactca 


6222 


tagttattag 


gtgacaccgg 


caagatttca 


6282 


gctaccatga 


tgatggtaat 


caaaaatggc 


6342 


taggtttttg 


cactgaagtc 


tgaatatgga 


6402 


cagaatcaaa 


aaatatgtgt 


aatgaattat 


6462 


atgtagcgct 


ttctcatttt 


cttcacgaat 


6522 


gataacatca 


gttcagacaa 


atattaaata 


6582 


gggatttttt 


taaaatcttg 


tcatttcatg 


6642 


ggagggtacc 


tgcaaaagaa 


aacaacaaaa 


6702 


ttacttctgg 


ggaggggagg 


aggtctggga 


6762 


tgtatattca 


ggtttttgtt 


ttttacaaga 


6822 


nnnnnnnnnn 


nnnnnnnnca 


agaaatttct 


6882 
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catataaaaa tatgaaagta atcagactgc aacactcagt gcctgagaca gagctacagc 6942 

tatcagggtg tccagacaga cagaagatta cattttcttc cttgctcctt gtacagcccc 7002 

agacctgcat gcttcattga aaagaaaaga agatacctga attaaatcaa tgtgatgctt 7062 

agtaccctat cagtgcacat ttcttttcnn nnnnnnnnnn nnnnnnnnnn cacttnnnnn 7122 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7182 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7242 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7302 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7362 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnca 7422 

cttatggcgg tattctcagt cattacaaat aaataaaaac aatccatatg ccctggagaa 74 82 

tttgattcca ggagtaggtc tagaagaact tcaactggag aatggataga gaaatcatgg 7542 

tatatttgca gnnnnnnnnn nnnnnnnnnn nnngatagca tgtgaataaa ttaattacaa 7 602 

aaacatatga ctacatctat tattatatag catgtagata aattacaaaa acatgtaact 7662 

acatctatga atcttagagc ataatattga gnnnnnnnnn nnnnnnnnnn nnnnnnggcc 7722 

agaagataac acatagcaca atgtcctttt cataaataaa tatattgctt aagcatacct 7782 

tatatataga agataaagct taaaaagtaa agaagag 7819 

<210> 15 

<211> 650 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 

<222> (20) . . (175) 

<223> "N" can be A, C, T, or G 

<400> 15 

attatcactt atgagggtgn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 60 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 120 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnctgcc 180 
ctttattttt tcatgaaaga aattgctgaa gaggactaaa agaagtttta gtaagcattc 240 
aataaatgta tgttctttat agtttccaaa tcagcaaata tagacatcct gcatttttaa 300 
ggagatttat atattttatt ggacatgctg taatttattt aaccacttcc ctgttggtag 360 
acattatttc cattttcttc tgctagatta atgcttgaaa aaaatgtgtg cctcctaaag 420 
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actgtgatga aagttgcctc tgaataaaac tcaaacaaat cattaatcat taactctttc 480 
cttacttgta tgctctttgg atgctctact gtgttatcta taaaataaag tttgaagtga 540 
aaaattaggg taaaacattt tatatcattt ttaaaggata tatacatgga tgtacttaca 600 
tatgcatgtt taaatttata taccataaca tttatttctt tttttaaaaa 650 



<210> 16 

<211> 9179 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> exon 

<222> (5211) . . (5516) 

<223> exon 2 

<220> 

<221> unsure 

<222> (1054) . . (1198) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (1297) . . (1430) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (1482) . . (1658) 

<223> "N" can be A, C, T, or G 



<220> 

<221> unsure 

<222> (3286) . . (3529) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (3580) . . (3620) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (4193) . . (4416) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (4506) . . (4682) 

<223> "N" can be A f C, T, or G 



<220> 

<221> unsure 
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<222> (6166) . . (6223) 

<223> "N M can be A, C, T, or G 

<220> 

<221> unsure 

<222> (6635) . . (6803) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (7946) . . (8351) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (8578) . . (8609) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (8870) . . (9178) 

<223> "N" can be A, C, T, or G 

<400> 16 



tatatttgtt 


ttttttctta 


catttttatt 


tcaaaatcta 


aggacatctt 


ataacccaga 


60 


aatatttttt 


ataccttgtc 


atgtcttaga 


ggaaagagcc 


accccagtct 


tttttcattg 


120 


atgtttttct 


tctctcttcg 


tactccagag 


gtagatgaaa 


accagagggc 


cacaatgacc 


180 


atggtgatgc 


ctgaggtcat 


tctggggcac 


agacctcagc 


ctaggttact 


ccacttcgcc 


240 


tatctttaga 


tccaaaacta 


ccctgctgac 


tgctgagata 


aacaaaggag 


aataatcagg 


300 


ttggggaaag 


gatttctatg 


cgaagacatg 


tctccatgca 


gtcctcctac 


actgagcaga 


360 


gcatgagtca 


ggtgcttaga 


gcaggatttt 


gtcctaaacc 


aggaacttca 


gagttttctg 


420 


aagaatgtgg 


ctatgtaaag 


caccccccca 


ccccaccctt 


acttctcaag 


tacattacgt 


480 


ggcaagtctg 


aaaaaactta 


cacttctgtt 


gttaaatgtg 


ggggataaaa 


tataaactta 


540 


gtttcaagag 


gaagctatct 


tgggaggtaa 


tgcaaataat 


tcgttgtgtg 


tttcctgaat 


600 


aagtgacagg 


tgctgactac 


cattgatgct 


tcattgcaat 


aaaatgcaaa 


gctcccccaa 


660 


gaatttttga 


aatgcatcaa 


gctaggtgtt 


ctaatctagc 


aaaaggacct 


gcatacatga 


720 


atttttcatg 


cttttgccaa 


gtcttttgcc 


ctttagttta 


gttaagggcc 


ccacatgaat 


780 


ggaaagcctg 


tgttgtcagc 


ttaattttgt 


agttgtggaa 


accttccagt 


tttctccttt 


840 


gtctaatacc 


ttcaggagtt 


caatcctagg 


ttgaagctta 


atttaataac 


catgtggcat 


900 


gtaaagtaga 


aaacaaaaca 


tcttttcctt 


agcatacagc 


aaaaaaaaaa 


aaaaaactca 


960 


ctcatggatg 


tagtgtacac 


atgccagtgg 


atatatagtc 


ataactgcag 


tcattggtag 


102 
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cacagaaata aatgtgcatt gaagacacag agannnnnnn nnnnnnnnnn nnnnnnnnnn 1080 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1140 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnngg 1200 
gctagcccag tgtctgactt actgtgttta agaaatatca actattacgc tacttcccag 1260 
tgacagtcca aatgcagacc agtgttataa ctctacnnnn nnnnnnnnnn nnnnnnnnnn 1320 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1380 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn aggcaaggtg 1440 
gtaaaagacg cttgcacgtg caaatgttac tttgtgtaac tnnnnnnnnn nnnnnnnnnn 1500 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1560 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1620 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnct tagtgaatct aatttgcagc 1680 
tgggcttgag aaaaaacctc ttcatagaat tgtttgcatc agtgtcttga ttgcctctgt 1740 
aacttacaat aagcaagaat gtttcaggat ttcaaaaatc tattgcattg cctaaacctc 1800 
ttattttgta tggagtaatc aagctcaaag tttgcatgtc ttagaaactt tacttggggc 1860 
aaaattagac caagtaacaa ttaatcttct aggtattctg agctattcag acatatgatt 1920 
catgtttgct aattgctctt ttctcttgta aatattagct gaaaaatgtc acctgtctga 1980 
caagtagcat attttatgcc tatcactcct ggcacgcatt cttacaaggc agacaggaaa 2040 
aataggaaga aaatggactt ttatcaaagg cccaggcagt aaagagggga gttctgctgt 2100 
aagctaaggg gagttccaga ggaagttata ggcgttccct ttcttatgac aagaaagcat 2160 
agtgcagtaa ataaatttgc taaatagatt caacagtctc tacccaaagt catctattta 2220 
attcttgttg ttatgcagac tcagcaacta accttccttg taagccccat tttcttccct 2280 
gtttcctgtt tatcaaatgt aattaaacaa gagaagtatt atagaagagt aaaagtagta 2340 
ggtaattctt gaacttggca tatgattact acatatttga tgaatagttg aatattattc 2400 
ttcaaggaca gattggattt ggtatcaggt ggctctgcat taagttataa gggacttaat 24 60 
aactcaagta tttaaggacg gcttccatca taaagggatc tgcccttaag agggtcccat 2520 
tatggagatt ctgaggtgag agctattcca agtgtgcagt ggattaaaat aaaagaatca 2580 
tacaggaaat ctctttttac atgccttatt ccagggtctt tgcaacctgg cacagcaagt 2640 
gcagatatga ttagcattgt tttacacatg tacactcacc ttatagccct gcccctgtgc 2700 
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ccctcctgca caaaagaatg ctgggcacac gtgaactcct ctctgtagaa aggcacatta 2760 
atgttctagc catggttaaa acagggatag aggcaagcca aaaatgtcgg tcatttgaaa 2820 
taaatctcaa gtttgtgcat atcactatca agtgtgctgt gtggcaatta agaatgccaa 2880 
tttgtgtgat cacaggcaag ttgcagtttg atgaaaggaa agcagaggtg aatatataac 2940 
cagggtcatc ctttctttct ccctctctct ctttctgtca tttatttgcc aagctcttaa 3000 
ctagaacttg ctatgtgcta ggtactggat atatcaaagc aaactcagcc tggtctttgc 3060 
cttcaaagat ttgcaggata gtgggaagaa aaacttgaat cagaggacat ctgcagtggg 3120 
aatcattcaa gcagcagaaa acccaaaagt tacttatact gtgaaatctg atcagagaat 3180 
ggactgtcct ggttagtaaa atatcctgga ggataaagat tggccatgca ttccacatat 3240 
gaattaccac tttcccaaga attaaaacat ggtacgaaag aaaggnnnnn nnnnnnnnnn 3300 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3360 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3420 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 34 80 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnna aacaacttct 3540 
ggaatagcta atgcttagaa gcagctccca aatatttgtn nnnnnnnnnn nnnnnnnnnn 3600 
nnnnnnnnnn nnnnnnnnnn gcaagctcta ctgaacataa tttgatctaa tcttctgtga 3660 
ttattcagaa actacttcaa gattttccta tacctccatc ataatgaata cccattcatt 3720 
aatgatggaa gcagcctaat tttgtcattt ttcacacttt attgatgtaa cactaccttt 3780 
actagtttgg ccactcctta tgcttttttt atagaactat ttagatcaat tcaactttta 3840 
aaaaataaag ccacataccc ctgtggtaga tgaaaaacaa gtatcatttg cactggtaaa 3900 
tagagaatag gaagaaaaat aaatgcagtg aaaataaagc agtgttatca aatcctaccc 3960 
agatactgtt atctacccgg aagcttcctg tttgattaaa aggaaaaata gccagtgtta 4020 
gaggtgtgga agtctagttg aaattatatg caattgaagg attaaaatag aattgaaaag 4080 
ggaataaatt cctctctgaa taatttaact ccctttaggc tttgattctg cctcatctaa 4140 
aatcatctta catacttcta gtggcgtgtc cctcacattt tggtaaactc tgnnnnnnnn 4200 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 42 60 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4320 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4380 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnccat ttttcctttc cctttattgt 4 440 
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cagaaaatag 


aaagcatcta 


cagtgggctt 


gtatgatgtg 


gtggttagaa 


atacctgatc 


4500 


tgattnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


4560 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


4620 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


4680 


nnaccatcac 


tggtgagagg 


aagtgatacc 


tggcacaaaa 


atatatggat 


taatcaatat 


4740 


ggattgaggg 


aaacaaacct 


ggagaatagg 


atgtgaaggt 


atttaagtaa 


catgagctca 


4800 


gaccttgatg 


gtagggaagt 


cgaaaggaag 


cattttgttc 


ttatatgaca 


gatgacctgg 


4860 


aatgactgca 


gggcttgggg 


ggtcagggac 


tggaggtggg 


agaggcctct 


gagagcaagc 


4920 


agtgctgtcc 


accagaagct 


cttgctgggg 


tgcccagaga 


ggagcaaagg 


gcagtcagct 


4980 


gcacaggagg 


gaatgtttgg 


aggagagagc 


cacctcagat 


cagcgggtca 


agaatcccac 


5040 


tcttgcccag 


atggatgggg 


caaaggagaa 


aaaggattcg 


ccacgggaat 


gtccagataa 


5100 


gacaggtgcc 


ttttggaaaa 


tgggggtgag 


atgggtctca 


ggttacactt 


cgtaagaact 


5160 


ggaatgtaaa 


gtaaaggcag 


acaatgacaa 


aatatcttgt 


tttcttttca 


get tta 


5216 



ttc aca gtg aca gtc cct aag gaa ctg tac ata ata gag cat ggc age 5264 
aat gtg acc ctg gaa tgc aac ttt gac act gga agt cat gtg aac ctt 5312 
gga gca ata aca gec agt ttg caa aag gtg gaa aat gat aca tec cca 5360 
cac cgt gaa aga gec act ttg ctg gag gag cag ctg ccc eta ggg aag 54 08 
gee teg ttc cac ata cct caa gtc caa gtg agg gac gaa gga cag tac 5456 
caa tgc ata ate ate tat ggg gtc gee tgg gac tac aag tac ctg act 5504 
ctg aaa gtc aaa ggtgagtggt gtcaaggact agaatccatg gaagctctct 5556 
ccaacagagg atetgeaagt cacagaaacc cattaaaggt agctcaagca aaaacaagca 5616 
ggctgctttt aaggagacag ctatttcaga gaaaatgaaa gcatctgctc ggaaataatt 5676 
tttgacatct gagtacaaag cagecgaagt acaagtgaaa gggggtagga cctataggaa 5736 
taaaatggga ctggaggaag ccaggaaaat tagtccctga aatgtgggag ggtatgaaaa 5796 
ataagctttg cctaattcac aattctccca tggaacatcc ctgacttgat tattaagata 5856 
ctctttttca atagtttata ccctgaatcc agagttttta aaaccatggt ttgccgccca 5916 
ttcatggatt aaaatatcaa tttagtgagt agcaaccaga tgcacgtttc ccgcccttta 5976 
aaaaataatg tatagaagag aatagacaga gtagatcaga cgatatcaca gagtaggact 6036 
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gagtactgta aaactaattt ctgagggacg tgtgtgtgtg tgtgcgtgtt gggtcatggt 6096 
ataaattttt tttttcttac tttggatcat aaaaagttac aagtttggaa aacactgctc 6156 
aaatgcaagn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6216 
nnnnnnnaga gtcccatgaa gtctatagct gtccctattc ctctggattc agggatctct 627 6 
ccactccagc acaattgaaa atctaaatat aaagagaatc ttcacactct tgtttgttct 6336 
agaaaaggtg atttgaggaa agacatataa caactataaa aaatagattt tgcttgttca 6396 
ttggcttatg gtctccaggc ttgaatgctc tgagataaat gatgccaata tttctctggc 6456 
ctcttcccat cccacgcatt ggacctcaga tggtctgtac tgtcttctag agggtttgtg 6516 
ggttttggcc ccaaaaaacc attaaccttg gcagaaagtg tgtgacttta tgatctggta 6576 
caaagaagga caaactagag ggactggaca tgaggatgaa tattgtgttc gcccttatnn 6636 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6696 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 675 6 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaaa gaccctcagt 6816 
tgttacaggg gcagtgacct cctcacacct caaccatcaa tgagtcacca ggaaagccat 687 6 
tagcctagat gtaactgttt tctatcttta ttgcatttcc tacatccagg cagcagctgg 6936 
gaggaactct agaacactga agtttgtctg agttccctta atgtaaggct gtacattctc 6996 
aggatgcctt gatgtactcg aatatctgca accctaaatc accacctctg tttttattga 7056 
tctctatctg aatgctgtat taatgggcca ggccttctgc ccattctctc aaactgagaa 7116 
ctgtctctca ttcctgggga ggcaccctgc ctactcctta cctagatcag ggatttctca 7176 
gttgtggaga gatttgttcc ttatagtgtt ggtcatcaaa ctgggatatt tggggattac 7236 
aaagactttt caagggatgt atgggcacag gcagttttag gaagtgagtt cctagatcct 7296 
catcttcccc aaatactcgt tcccaaaatt gacgagcctg acaatgtgca tgccaggcaa 7356 
ggctcttggg gttcccctaa aacacttcct cttttaagcc taccactcac tcatcatgaa 7416 
tatagtccat tgtcccaggg tgtaaaaccc tctatagtgt taaataaaag aatgattggg 747 6 
aacattgaca cctgatggaa ctgttatgac taaaaaccct tttgcaaata atgtggtatc 7536 
taattttctg ctttcaacaa aattgaagga ggcccttata aagttaataa ctgataatca 7596 
aaaatgagta atttttgcca tgtaaatcag gtcaaagaat gaaatggcat tgctgtaacg 7656 
aaactgcttc cattcccatt gatttactca tacgaacaag attccttagc ctttataagc 7716 
tacaaaaaaa tgaaaaatag aaatagaatt gaggctgaat tctattatat aaaatcattc 7776 
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caaccatgtc atatggttct tcggattcat gaataatttg gaaaagagag ccatatccat 7836 
cttattaagg gacacattcc caataaattt tcatctttca tgtttaataa ttatcaatat 7896 
tcataacatt ttacattttg atcaaatatg tgttaataat aatagaaatn nnnnnnnnnn 7 956 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8016 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 807 6 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8136 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8196 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8256 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8316 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnattag atcttaatgc agaacaccct 837 6 
gaacatttaa agcttcatag tcacaagaga aaagttttca tttcaatagc tataaatatt 8436 
ttgttgttgt aaagacatat aacgataatc aatacaaaat ctgtcaaaca aaaatatgtt 84 96 
acattaagat aaaattctgt agggaaggtg aaattggaag tgagtttcaa tgaatgaaaa 8556 
gaaacaattt agacagagaa gnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnncttagaa 8 616 
ggaattgaat agattaggtt ccttacccaa aaagcctctg ttatttgtct tatttattta 867 6 
ttctcttttt tccacattct ccagtctcat tccccttttt taacacagga aattattcca 8736 
gcatgtttca tacatattct tttgtttgta agagcttatt taaaatatgt aatattgttt 8796 
tagatgcata tatttttttt cttgtggaaa ctatattgta ctatatatat atattttaga 8856 
aatggacaca ttannnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8 916 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8 976 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9036 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9096 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9156 
nnnnnnnnnn nnnnnnnnnn nnt 917 9 



<210> 17 

<211> 1814 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> exon 
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<222> (738) . . (1010) 
<223> exon 3 

<220> 

<221> unsure 

<222> (72) . . (118) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (405) . . (630) 

<223> "N" can be A, C, T, or G 

<400> 17 , , , , n 

tagatctcag ctttcttgag gcagggagcc atatctgttt aattcactca gcatatactg 60 

caaagaagca gnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnta 120 

gggagaagtg cagaataaat atacccaact ctttactatg tatagacatt atctaggtct 180 

ttattttttt tctcttctta atctcaaaga aaacagagga aaggaggaag taaaaagtaa 240 

atttttgcct gaagatgttt ggaaaaaata ccaaataaag tgagatagtg ggtaatctag 300 

tgatttttat ttttccgtcc tctttctggc ctccaattgt gaaataattt atagcactgt 360 

aagaaagaag ccacaaattg tggtagcttg gaccactgtt gaggnnnnnn nnnnnnnnnn 420 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4 80 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 54 0 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 600 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn acttaatatt gatagtgata attttattca 660 

tttcacatgg catgaagtac caagctctat aggaatcaga aaataaagtc ttatttcttt 720 



ttcttctcta ttgtcca get tec tac agg aaa ata aac act cac ate eta 



770 



aag gtt cca gaa aca gat gag gta gag etc ace tgc cag get aca ggt 818 



tat cct ctg gca gaa gta tec tgg cca aac gtc age gtt cct gee aac 
acc age cac tec agg ace cct gaa ggc etc tac cag gtc ace agt gtt 



866 
914 



ctg cgc eta aag cca ccc cct ggc aga aac ttc age tgt gtg ttc tgg 962 
aat act cac gtg agg gaa ctt act ttg gec age att gac ctt caa agt 1010 
aagagctgee cccacttcct aggtctatca gttagggttc agacaagaaa cagatggcat 1070 
actcgagtga tttgaggaga gtgtaataaa gggactgttt acaaaggtgt gatcaccatt 1130 
tggagaaact acaaaggata gtgeagaaca ctggggcttc aatgttggga gggcaattac 1190 
cactgttgga gaagttactg gaatcagaag ggagctgtag ggaaagcccc acttcccagg 1250 

32 



agctgtagcc acagaatagg gaagctgcca catgcagcga ctccaaaggg tggaaactgg 1310 
atgaatgaat accccaactc attctcctcc caccctccaa tctcctgcta gcacctccca 1370 
ttggctgaac ccagctagaa gtcagagaat acaagggtcc actgttgtat tccataaaag 1430 
tcaacttctc agggctcaga gcaatattga catgtacaga atagatctgg agaggaaaca 1490 
gaaaatatct agtacaatag ctaatcactg tgattcatgc acagtgtcat gagccagcag 1550 
gatgaatatt cctttgctgt acttgctgcc agtcagctgg ttatgggttt ttccaagaaa 1610 
tttggtctct aacaaaattc ttcagagcct ttactgacta tgctggatat ttttggaagg 1670 
gatcccatac ttttgaactt catacagcag aatttcaaac aatcttggga aaataacaac 1730 
ttttatctgc ccagtaagga caactaacac ctagtatcat aatcatttcg taagagacag 17 90 
gtaatttcat caccgagtgc atat 1814 



<210> 18 

<211> 19217 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> exon 

<222> (6412) ♦ . (6543) 

<223> exon 4 

<220> 

<221> exon 

<222> (11953) . . (12003) 
<223> exon 5 

<220> 

<221> exon 

<222> (18746) . . (18751) 

<223> coding portion of exon 6 

<220> 

<221> unsure 

<222> (722) . . (1277) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (1740) . . (1924) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (3585) . . (3783) 

<223> "N" can be A, C, T, or G 
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<220> 

<221> unsure 

<222> (3929) . . (4224) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (5233) . . (5352) 

<223> M N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (5974) . . (6030) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (6048) . . (6073) 

<223> "N" can be A f C, T, or G 

<220> 

<221> unsure 

<222> (6091) . . (6306) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (7576) . . (7688) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (8259) . . (8368) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (9787) • . (9885) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (9926) . . (10110) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (10113) . . (10241) 

<223> "N" can be A, C, T, or G 

<220> 

<221> unsure 

<222> (11000) . . (11438) 

<223> 11 N" can be A, C, T, or G 

<220> 

<221> unsure 
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<222> (12528) . . (12844) 
<223> "N" can be A, C, 

<220> 

<221> unsure 

<222> (13720) . . (13963) 

<223> "N" can be A, C, 

<220> 

<221> unsure 

<222> (13983) . . (14275) 

<223> "N" can be A, C, 

<220> 

<221> unsure 

<222> (14529) . . (14551) 

<223> "N" can be A, C, 

<220> 

<221> unsure 

<222> (14585) . . (14621) 

<223> "N" can be A, C, 

<220> 

<221> unsure 

<222> (14703) . . (14749) 

<223> "N" can be A, C, 

<220> 

<221> unsure 

<222> (16380) . . (16436) 

<223> "N" can be A, C, 

<220> 

<221> unsure 

<222> (16484) . . (16517) 

<223> "N" can be A, C, 

<220> 

<221> unsure 

<222> (17022) . . (17366) 

<223> "N" can be A, C, 

<400> 18 

ctactgagaa gggatatact 
ctttatagaa aattcactat 
gtaagacaga gcctatgata 
gagggcagaa gcaagagatt 
agttgtagtg tggacaacag 
aagatgtcta ctcagacaaa 
attggttgac aggaatatga 



T, or G 
T, or G 
T, or G 
T, or G 
T, or G 
T, or G 
T, or G 
T, or G 

T, or G 

ctcagactaa aggacagtcc 
attgtcattg tatttcacag 
aggacttgtg tggcatgcag 
taggagttta agagagggta 
aaactcaaat attcaaggac 
gaatttcagg tggaaggact 
actccattct gctgctgggc 



ctagtactga ttcaatctgg 60 
tttgcccttt gtcttagctg 120 
gtatttaatt ggcaacccca 180 
atataagagt atattatcaa 240 
cagcatgtag acagcctcct 300 
tgttcatctg cttcacgccc 360 
tagacatgca tgtgggctga 420 
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gtgagctttc cccagtatcc gtagcatcag 
aatttgaggt gaattactga ccttgaagtg 
ctggctgaag tgaaaggtga ggctgagagg 
gtactgttca aatccactca tgcccttcat 
tgggaggcag ccacaatctc tgcagaagat 
tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnacc tgcagcactg 
ttgaggaggt tgtcaggagg aaatgagtta 
cagttggggt gaggcagaca tggtatgagt 
ccaaattaaa attaattttt tcatggagtg 
tttgcagtga ctaaataccc atctcatcat 
ttcttccctg atacagtttt cagaggcagc 
acttccctac ctatccacaa tttactaccc 
ttctaactaa attattagtt tagaaggaaa 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnggggag aggaggggag aggaggtagg 
aattttagga ccaacttcag gtaaaaaaat 
ctacctctct ttatatcatg gttgagacca 



aaaagtcgca 


gggcagaaag 


aaaagtatcc 


480 


agtgtaagcc 


taactagaat 


tctaccccag 


540 


aaataaggca 


ggactgcaca 


gtccccaatt 


600 


taagtcagct 


ctgccactga 


gccttccagc 


660 


ttaatataca 


ccagtttgtg 


gaacaagctg 


720 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


780 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


840 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


900 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


960 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


1020 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


1080 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


1140 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


1200 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


1260 


ggatagccct 


ggtacagacg 


tagaacatgc 


1320 


gaccttgcac 


agaactacca 


ccatcaagta 


1380 


tgaggcatca 


gagatatctg 


atgctttatg 


1440 


acactgatcc 


acagaccaga 


ctccaagaac 


1500 


aactttcctg 


gtattttctt 


ctggaaaaaa 


1560 


tagatgcact 


gtcatctctc 


cccttttccc 


1620 


aatgccaaca 


ctaaagttag 


cccaacttcc 


1680 


gagaggagtc 


atgctaagga 


tcttaactgn 


1740 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


1800 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


1860 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


1920 


gaggggaaag 


aaagaaacta 


gaaatccatc 


1980 


gaattaggca 


agttggtctt 


tcaacattct 


2040 


cagacttctc 


acctcatgaa 


agatgaactc 


2100 
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taactaattc atactaaagc taaagcctct 
gaactttttt cccctggaat tgtttattca 
tggttaagtg taggccagta ctttggatga 
ataagaacct ctccagggaa acaggggcct 
ccactgatta tgtcagtgtt ctattctgca 
ttgctctgga agcctttacg ctgtgagaag 
ctcttcatgg aagccaacat ggcttactaa 
ggtcttcatc agagctcagg atctcctagg 
atggagaaat ctttgtgcag ccagaaattc 
ttgtattcgt cctactgttt actgacatgg 
accacatgct ctctgaactt ggctgcacct 
ttttccacat aaatatcatt atttccctct 
ggcagctcag tcactccagg gcacaagggg 
actacctgga ccatttgttt gccttgtaac 
ttccatgtat tttcattgtt cacacaagtc 
gtcctgtgtt cataataagt gctcaacaaa 
ttagataatc gcagtcccat catgccagct 
agccaaatga tattctagct tgtggcacaa 
atctgcctct gagttcctgg cttctatttc 
gttccctttc tactcattct cataaattga 
ccggaagtca atgtataaaa caaacacgga 
cactgactct agaatggatg actgaacatt 
agatcatttt ccttttgaaa tagggtttct 
ggaaatggaa tgggactaca gcacaaaaaa 
agaaaagaga atccttccta ggagacctgg 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnntcagaaa gtgtgcaaac agtaaaaaaa 



aaagaggatt aaatatgagc aatcccacga 2160 
actgtcgttc gttatatgga atttcctgcc 2220 
attgtagttt tctagaaaga cgcttcttat 2280 
gtatgagatg aattgagaaa taactttaca 2340 
tggtagagat gtgaaagggc agactgacca 2400 
ttaacagtgg agtaaaatgg ccactccact 2460 
atagtcaaca accatgggag agacctgtgg 2520 
gtatcactca taaatacagc catcagggag 2580 
tcaacctggt tttacccatc cttcccaact 2640 
atcctctgct tcattaacca tcccttcctc 2700 
tttctacctc catgccttct ttgctcaggt 2760 
ctactagctc caagcccacc ctctctctgg 2820 
gtctttccct catcccacat tttgagacct 2880 
tatgcttgcc tttttaattg ctattttatt 2940 
ttctttattc cacactaagg caaaagcaga 3000 
tgttgggttg attgggttgg agattccatc 3060 
accagactgt gtggacagcc aggtcagagc 3120 
ataccagcaa caaaataacc aaagtcacac 3180 
tcaagggcat ttttaagttg tcttatgact 3240 
gctgtggact gctgtgaccc acaagcttct 3300 
aacgaagagt atggtgggtg gagggtactc 3360 
ccaaatttca agcacaagtt agggagcaac 3420 
tctgctcagc cagttgttgt attttcatta 3480 
taaatataaa aggacccttg tagggctggc 3540 
aggtgattcc aggcnnnnnn nnnnnnnnnn 3600 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3 660 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3720 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3780 
aatggtatat ctagcaagtt gcatgcctta 3840 
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cttgtgagtt catgaagttg tggcaaggat 
atatcattgc taagagtatg ccattattnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnttaatt 
gcaagaattt taaaacaact tagaggaata 
tgaagagacc tcgaaataca gtgagaagcg 
ctcaggtggt tcagagcagc taagcagcta 
tattttcatc tgttgctctg ccattctcca 
cagtcagtgc caaacccatg tttcaacctt 
tcaatgtttt aaggccaaga ccttgaaaac 
ggctgggccc acttaactat aggggaggct 
ccagctaatt ccctaatact aaagttgaaa 
aaggtaacaa atgactagga ggatcaggct 
gtgcaacaat tccatattga cacttttcat 
actcagaggt agtttagata aggcctttgc 
cctactggat gttcaacttt ggagtcttca 
gactatgagg gaacctgcac aggacaaggg 
tcctgttgtg gggacagtga gaggcccagg 
acttactaga aattaagaag cagattataa 
caaggctcct tatcttttct tcgttaatgt 
tctgtggggg ctnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnggctcaga ctaaataatg 
ggttctcaga tagcactgtg ctggaggtag 
ctgagccctt cccgctgagc agtttcgtcc 



aagacaaata ttttttgcca ttgcatcatt 3900 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3960 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4020 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4080 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 414 0 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4200 
aattaaatta aagaaacgac aaaagagtat 4260 
tgtatgagga tacaggctaa gctaccataa 4320 
agacagaagt atctttcgtt ccatgtaaca 4380 
tgttccatag agtcattcag tgatccagat 4440 
ggatgttgtc cctataaaat tgtcaaagct 4500 
cagaaagtaa acgagtggtg gaaaacacat 4560 
tcactctctt agcctgaact tagattacat 4 620 
tggaaacata gtctctgaga agccatgtgt 4 680 
gaaagaatgg attaaccagc agtataccac 4740 
aggtggacta gaaaagagac agtcaattca 4800 
gtagctgttg cttggctcta tctagagagg 4860 
cctccaaata cagtctaagc agactgattt 4 920 
gggatgagta gggcttctgt acgtggaaga 4 980 
tttgcataaa gacactgagg tagggacctc 5040 
tctccttgac tcacaaagtg cttactaagc 5100 
tcaatatggg ttatccaatg tttggatgag 5160 
taatcacact cttttggatg gagacaaata 5220 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5280 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 534 0 
tctaatctct tctccagtaa aacaatccgt 5400 
tggggtttga gggctgggaa gttgggagga 54 60 
agtttttcct gtaccagcct gtcatgttta 5520 
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ttccatgtga 


atgactccag 


aggcaaaatt 


caagcttttg 


aatagggcac 


aaattaactt 


5580 


gagtaccctt 


tcatttccct 


gtaggtgaac 


actcctctag 


ccctgccttt 


tgtcagtctg 


5640 


gagcccttgt 


tctaatctgt 


acacaccaga 


ggactttaca 


aggctttccc 


cagcctccag 


5700 


aattattctt 


ctgatccacc 


ctctactaaa 


ctcacccttt 


cctcagtgct 


aggacgttga 


5760 


aaaaccgaaa 


caaggcaaag 


ggccaattgt 


aataattcac 


actaaggcat 


gagtgactag 


5820 


gtttagtata 


ttaacactac 


ctaggatatt 


ctatttcttc 


caaaaggatc 


ctgttaatcc 


5880 


ttgaaattta 


acaactaatg 


gtatagattc 


taagcactgt 


gagtacttgt 


cagtggggga 


5940 


aagacatttt 


tgggctgaga 


gactttgcca 


ctgnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


6000 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


aagaaagatg 


attagatnnn 


nnnnnnnnnn 


6060 


nnnnnnnnnn 


nnnaaaaaat 


aacatgagag 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


6120 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


6180 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


6240 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


6300 


nnnnnncaca 


gagctagccg 


tgttggctgt 


cacccactca 


tgtggccagc 


ctgttggtct 


6360 


acctcttagt 


tgccatgtaa 


caggattctg 


gtgcttttcc 


tttgcccagg 


t cag atg 


6417 



gaa ccc agg acc cat cca act tgg ctg ctt cac att ttc ate ccc tec 6465 
tgc ate att get ttc att ttc ata gec aca gtg ata gec eta aga aaa 6513 
caa etc tgt caa aag ctg tat tct tea aaa ggtaagtgag ttttattcat 6563 
ggtaacccaa tgcactgggt gtctgeagea tgagccactg etttgeactg caggectatg 6623 
gettgetget ttcatgctaa acccactcag agcttatgaa ccactttgag cttgtcttga 6683 
tgattatttt tccccagaag aaaatggctc teategtcag tgagctgaac ttcttacact 6743 
gagtttttta aagggaatgt tttgttctta tgtctgaaag agtttgtctt attctttgag 6803 
ccaagagctt tcatcagcct catgagagtg atgttatttt ggcaatgcag agagctacgt 68 63 
gctccgattt tgctggtggg aggttgccag gatcctttct gaggattcct tccattttca 6923 
cccctctttt ccccagtctg gatatgacct gggttaaacc caccccctct cccaggaatc 6983 
tcaacctcac ggttgggtaa ggaaaggaga aaggtttgtg aggecatttg gggataagga 7043 
aacagctggt tggtggtgca ttaaegtett tcagcagctc ccttcgagtt tctccttagc 7103 
ctgttgtatt cttaccaaca cactcctgtt ctgttgtacc agctgggaca gagcatgetg 7163 
aagectttea gecctgattt cattgettea ttgttcatgt gtctgtcttt ggtttcctgg 7223 



gtggagcctg cccacaaaac ccccagaatg 
ggctcccttg tctgcactca atgaacttct 
aaagagaaat gacatgtgga aataattcag 
taggggaatt ttaagtggtt gttatcttac 
tgccataaaa tattgtcatg ttctatttgt 
gggtgctgac cacaatattt ctaactgtag 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
ggaggtatgt ccaacagaac ttcgactttt 
gtgcacctca gttgtcttat cttctgccat 
gggttcaagg cctggggcct gaggactcct 
tccgttccag ctgtctttgg gagaatggaa 
aaatcctctc ctcatccttc accccaatct 
agctgggagg taaagctcaa gttctaactg 
ttaagtgaat aaatgtgcaa acaatttctt 
tggttgccta tataaccctt gtctccaact 
tgtctgccat gcacacaaac ttgtgtgcca 
agacagagct ttaaannnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnngcagc aagagaagga acattttaca 
gtggttcaac ttggactaca gagaaatctt 
tacctccaac ttggtctgtt ctcacttttt 
taccatgtgt cctctgtcca gaccttctcc 
ccaacgagcc caggatggtt tgactcatgt 
cattgtctaa agctctaacc acacaggatt 
aattccgagt tttctcatac aatattgtct 
ctgagttgga gtcatctttt aaatctcagc 
ttctacctcc acgcaccaca cccattacat 



tatgcaggcc tagctggtgc tttcctaaac 7283 
ccaaagatct atacatggcc tcatctatag 7343 
taggagtttg cagcagcact atctgaggac 7403 
atttatactc ataacttcta tattttcatc 7463 
ccattgccct atgtgtgtat gtattcactt 7523 
aatgcaagga attgttgcca aannnnnnnn 7583 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7 643 
nnnnnnnnnn nnnnncttca tggtaaagga 7703 
aaatagaacc acttcagaga gttgtgtcag 77 63 
tcttctttta cctctcacac ccatacctca 7823 
taataacttc agaaatgagc agctgagtgt 7883 
tggagtcaca ctcaaagata gagtggaaat 7943 
taagagtgag tgaggatatc agtagctccg 8003 
tgattaggag acctttctta caaataagaa 8063 
ttatattttt aatgaaccag agagaaatca 8123 
cacttgcatt cagatctgct ttcttacatg 8183 
tggaaaaggg ttgagaactg ctggtgatgc 8243 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8303 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8363 
gcttattggc cgaacttcac tgccgctagt 8423 
cctaactggt ttccctgtat tcactcctgc 8483 
gctataatag gcttttaaaa atcataaatc 8543 
atggcttctt attgctcact ggatgaagtt 8603 
ctccagcttt aactgcatca ccatcacctt 8663 
ttctagtcct cagaggcatg gcagtctttc 8723 
cttcttaaaa tattttttct tgttgtccac 8783 
taagcttata cttcatcaag tctttcctaa 8843 
taaatcccct tattatatgt ttccatagca 8 903 



40 



cctactttct tcttttcagt atactcagca cacaatcaca tgtctaggat ctgttttaat 8963 
agcttggact accaattaaa ttgcatccct tttaattgtc cattgattcc tcaagtaccc 9023 
acatgcccat cttagcaaga agttcagtgt ctccctctta tagcatgtac ttctccacct 9083 
cccacaaact gccagaaagc ttacttagcc cacagggcca gtgctaggca gctaggttag 9143 
tcctccagag ggccctggtt ttgagcagtt gctgtctact ccggccatgc agaatctctg 9203 
gtccttccag atgtctccat ccactgtgca aaggtaacct tgctggttcc gatccccaca 9263 
cagaccacag tgctacaaga ttacagttct tatggttccc caacacatgc tctgtcattg 9323 
gtcccaaagc aggaccccta tgggttgatg aggtaggagg aggtccctgc cttagccaca 9383 
gctgcacaca gccagcctct tcccttctag gccctcatgt tgagcctggg acgccagtcc 9443 
taacttcctt ctcttcagtt cctcttaggg ccattggtat cctgaatttc ttagtccatt 9503 
gcaaagttaa gtaaagaagc agcaggcttg gtccctttcc ttccagatgg cttcttagct 9563 
cctgaacaga tttacccacc tatacctcag tgactagctc tgtgtactaa agtgtattgg 9623 
gagggcagcc attattggtc cataaaaggt cctgcttacc attttcccct aagaggaacc 9683 
attcaacagt ttggggctcg agggtgacct gctgggctct agagaagaag ctggcaactt 9743 
ctgttgcaaa ataatgttaa attctgcttc atctgcttgt cttnnnnnnn nnnnnnnnnn 9803 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 98 63 
nnnnnnnnnn nnnnnnnnnn nntaaacatt gaacctacta tatgcaggtg agtatgctag 9923 
atnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9983 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10043 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10103 
nnnnnnntgn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10163 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10223 
nnnnnnnnnn nnnnnnnnga tttgaataca ggtctgtttg actccaaaac ttgtggccta 10283 
tttgttgcaa aagtgcttaa tacaaattgg ttcagtcaat attattatct ttgaacaatg 10343 
gaaggagaaa gtaagtttca atccaaaata attgagtgac ttatacattg acttgctgag 10403 
ccaatggcaa agtcaagtta gaatccagca gaagtcacca gctacagaat ctagatcttt 104 63 
agaacatgtc ttcagatctt cagaacagtg tttcttaaac tctcttgtga aggaacagtt 10523 
atcatcatag gctggtaaca gttcacctac cagcaccagc ccatgaacca gactctaagt 10583 
ggcacagccc tagaagattg agccagaatt ttacagaggt ttaaagacca aatatgctgg 10643 
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tttatggtta 


cctgtggccc 


acagagaatg 


gcagcactaa 


cctcaggcat 


aaatgaggta 


10703 


cccactgaag 


ccaacattca 


agagcaattc 


ctatgggtta 


accattgggc 


tcctttcaaa 


10763 


tgcaaaccct 


catgaaagag 


actacagtgc 


tgaatagaga 


cctccaaatt 


ccaggccaag 


10823 


ctcaggatag 


tcatgaggga 


attactaaaa 


acctggtata 


tagggcaaaa 


gcagaattag 


10883 


gaatggactg 


atttcaggaa 


cccaggcaat 


ggcaggagtt 


gggcattaaa 


tcctaaaaga 


10943 


gaatcagagt 


gggagggaat 


atgtgaaatc 


agaggttaag 


aaaaaagtga 


aaacctnnnn 


11003 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


11063 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


11123 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


11183 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


11243 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


11303 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


11363 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


11423 


nnnnnnnnnn 


nnnnnaaaaa 


attaaaagaa 


agatgtgaaa 


tcaaggaaac 


ttactggtga 


11483 


gcagcatccc 


attatgtgaa 


cttgtgcttc 


tgaaccagta 


acttgagtta 


ctttgagcca 


11543 


gtatcagtca 


cttatacctt 


agtgcaaaat 


taattgatca 


gacattctga 


cctggaccag 


11603 


ggaaggcagg 


cagaagtagc 


agtcaagact 


aaagcagaaa 


agggagagct 


aattctgcag 


11663 


ccagacattt 


cctggattga 


atacccaaat 


tagtccctca 


gcctttaagt 


gcctgagggc 


11723 


C a rrrr Pi rrt" arra 
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ttcacactct 


ttgcctaggg 


11783 


gccagatttt 


gctttatgca 


ttaccatccg 


aagtcccagg 


ccacagtgaa 


catttgggct 


11843 


tcgctatgtg 


gatttattta 


gatttacttt 


ttgtcctgcc 


atattttaat 


ctataagcca 


11903 


aacagttttc 


tcattaatct 


tattccattt 


ctggaatttt 


tccttttca gac aca aca 


11961 



aaa aga cct gtc acc aca aca aag agg gaa gtg aac agt get 12003 
gtgagtaagc atgattttta cttttctttc ttactttctt ttctctctca gcttgaattt 12063 
taaagtaacc actgttctat taattcatgg aaggcaactg aatagttcca gcttatagaa 12123 
tcttcctgtt tggtagcatt teagegaage ctegttctta gccccagaac aatcatgeca 12183 
tettttgetc ggtctatatt cctaagcact cctagatgat actgeactgg acctctggtc 12243 
tcacatagtt agaaacagag ttaaaatcga acagcaaaga gaagatattc aactgegatg 12303 
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caattgacaa tggatgtttt tgcaacaaac 
tgagtcaaga gtaatatggg aaaaacacaa 
tggaatctgt ggaggaggaa ggatatgatc 
cattaagcct ggttggatga aaagcttaga 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nggcaccaag agatgagggg ggcagtcttg 
gtcattcctg catgagcctt tataaacaga 
gagttgtcca agaggccagg gttctgtcta 
agattgcatg cagactacca gttttgggtt 
tcactgcctc tcaaactttc agcaacaaaa 
taagtatttg gatgcaatag caatgaatat 
ccctgtagcc aaaatgattt aacaaaatca 
caatgaagag actacttgag gtaggtttga 
acctaatgag gaatttaaat atttctagat 
agcataccac tgccagatct aatcccccca 
cttgtctcta tagatcatat caaatccttc 
ccacatttca ctaagccaac tcatctacca 
ctgctccacc aacaccactg ccatgagctc 
gttatttttt tggagtgtcc tccctattcc 
atggtctact ttaagataca gtaatgagta 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 



aatgattaag aagtacattg ttgtgggctc 12363 
gtctcttcat gaggttgaca ggtttggagc 12423 
taggggtcag aagaagtggg ttactaaaat 124 83 
ctcaggggaa gcagnnnnnn nnnnnnnnnn 12543 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12603 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12663 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12723 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12783 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12843 
gccatatatt tggctgaagt aagtcaattt 12903 
agtaagtaac caactactat ttggtcattg 12963 
atacctgttc atgcatgaac atgccaacct 13023 
tttgtttagt tcagcaggat ttttctcagc 13083 
ggacatctgt gatatcagaa tctaccactc 13143 
ctgagtaaat ctaggtgggg agtgggggca 13203 
aaccaaaatt ttggaaatga tgccttggta 13263 
cttatctaat atcttatttt ctttaccaat 13323 
agctttggaa aggtccctta aagaggcacc 13383 
aacactgttt tcatcatcat catgtcatct 13443 
ccagagtttt tcaggccttt tgacaactag 13503 
ctcttcaaca aaacttttcc tcaagttgag 13563 
attcccactt ctgtggcttt gctcatgttg 13623 
ttcttacttg tcccaatccc aacttttggc 13683 
actttannnn nnnnnnnnnn nnnnnnnnnn 1374 3 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13803 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13863 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13923 
nnnnnnnnnn tttttaaaat ttaattcaan 13983 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14 043 
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nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14103 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14163 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14 223 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nntacataca 14283 
caacactctc ttgcttaatc tgtaagactc tctcccccac tcataccttt ttatttttcc 14343 
tctgcattgt acacacaatc tataccactc ttaagcacat gattacagcg ttattttctg 14403 
gctgcttcta tgtgtctata ttttaggtcc acctggtcaa tataataaag tgggatatta 14463 
gtgttaatgc aactatatgg tatttgatat ttgtctttct gtccgtttat caatgtttct 14523 
tatagnnnnn nnnnnnnnnn nnnnnnnngg tgtctgattt tgaccaaatt tgactaaata 14583 
cnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnngt tgagtctatc aactcaaaaa 14 643 
agaataacct acaacaataa caagtttcag aacatttttt aaattactga ttttatgagn 14703 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnngaat ccattgccta 14763 
gaaattgcca tggttaagat tttaatattg ctcaggccca gacagctcag ggctttgaca 14823 
ttcccacacc cattctctgc catcccagtt ctatctcatc ccaaaaccat ccattatgag 14883 
gagagtgtac agctctaggc tgcccgggag ccatcccgca ctctcatttt gtgactcggc 14943 
atcttgggag atggagtctt gggacttagc ctggacatgt cccttgcatg tacttcttac 15003 
aagactttta ttcagatgaa tattttccct tccaacttaa gaagcacagg gcttgctggt 15063 
tttgcttcac taaccagcaa ctgaagcaag acctgacttg tgaaaatgcc taatagagtt 15123 
cagtattagc gctgtttcac catttcctgg atctcttgcc tttgtgcaca tgatagaatt 15183 
gcacttctct gtgattaatt tgtagttaag tgtggtcatg taactcgctt tggtcaatta 15243 
aatgtaagca taagtgatgc gtgttatttc tgggtagaag atgtaagagt tggcatatgc 15303 
tttgccatat tttctttatc catctggcat ggtaaccagt aacattctag gtagtaattg 15363 
ctccatcagt ctcagtctct gagtgactaa aattgacaga gtcccctgct gaccctcaat 15423 
gtacatggaa catgaacaag aataagcttt tgttttttat attgagattt tggagttgtt 15483 
tgttcctaca gcattaccta gtttactcta atacaacatg gaaaaaactg gaacctataa 15543 
taaatagacc ctacgttgcc atttaaactt ctagttctga ggaataataa tgtggggaaa 15603 
tactttctat ataataaaaa aatagaaaat tgcaaaataa aaatatactt atgtatcatt 15663 
catgtcctat taaaaatgtt atttatagac tcaccatatt cccttcctcc agaaaaatag 15723 
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aagtaaaaat 
tttactttta 
tattactttc 
tctctagatc 
taggaagggc 
aaagactcct 
acttgcataa 
gagtccttgt 
ttatctgtgc 
agtaaaactt 
tagttattac 
nnnnnnnnnn 
gtagcagacc 
nnnnnnnnnn 
ccatgtggga 
accgataagg 
gccagctttc 
tcatgagtat 
attccacatg 
cttttggcac 
aaattaaaat 
tgtccatatc 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
aaatcatgta 



atgaaaatgc 
tgttttctga 
aaaaagaaga 
ccagtaacca 
gatgctatac 
agcacagcta 
ggtcatatca 
ctgtgacaca 
tactatttgg 
taggagtgga 
agtcatcaag 
nnnnnnnnnn 
aaaagaagtc 
nnnnaaacct 
actagaagga 
tactcaagaa 
tatatacttt 
cccgacttcc 
ggtgtgtttc 
tataaacctt 
agtggtgaaa 
atatttgtcc 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
gctctcaaat 



ctgtaatcat 
atttttgttg 
agaatacttt 
aatacattat 
cttctttaaa 
aatgtgatgg 
taacttactg 
ctgccctttc 
gcctagccat 
ttatgatacc 
ccttggttag 
nnnnnnnnnn 
atgattccca 
gtattaagtt 
tgaatttatc 
acagttctca 
tggtccagtc 
ttctaagaac 
ctttcccctt 
tcgcaaagga 
ttagagtgga 
ctacaaaggc 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnctctaag 
tatccctatg 



gtttttggat 
tacttcacta 
ctgaagccat 
ataggactct 
ggacacctac 
atatgctcta 
ttagaaatgg 
catttgtgga 
tctccctctt 
atagatatat 
agtttacaga 
nnnnnnnnnn 
gcatagtgct 
tctgttattt 
agtcatccaa 
ttcacataat 
tttacgtttt 
ttccacctga 
tcccatttca 
aatattagac 
ggacataatc 
ctctaaggnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
ataccttgtg 
tcctgaagcc 



tatggaatca 
catttttgag 
ttcaaccatc 
tcatcagtcc 
caatgtctta 
aggatataag 
agctagaact 
agttgttctt 
atgcagacaa 
atcatctatc 
ccatgtatcc 
nnnnnnnnnn 
nnnnnnnnnn 
gcaagaccct 
gattcttaca 
ttgggttaaa 
ttgttttgtt 
gaactgacca 
gtggtttcca 
agaactccta 
accctatcat 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
cttcttggaa 
caccttacca 



agtattgctt 
tgccctgatg 
cccactcacc 
ttatcaagtt 
gttgcctttc 
agctgaagtg 
cagacccact 
gtatctaact 
gcagataaac 
ctttacaaaa 
tagctannnn 
nnntgaaaca 
nnnnnnnnnn 
gtcagttagg 
attaagtatt 
acaaaaagaa 
ttgtttgttt 
cagcgtcagc 
atttcttttt 
catgtcaagc 
ataggctatt 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
catatttgga 
tcaatcctca 



15783 
15843 
15903 
15963 
16023 
16083 
16143 
16203 
16263 
16323 
16383 
16443 
16503 
16563 
16623 
16683 
16743 
16803 
16863 
16923 
16983 
17043 
17103 
17163 
17223 
17283 
17343 
17403 
17463 
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gaaataccca acccatgtca ggcaacttca cactttcttt cttcaggcag cacagttgtc 17523 
tcagggaggg aggagagtgc tattagcaag aggagtcact aacagcttca ctcacctgtg 17583 
cccatgaatt tttaatggtg tgaaaagtct gtgtattttt tatggttctc tatggctcat 17643 
ataagaggga caaacaatac atgaaagttc agagatgggg acaaatatca ctttaagctg 17703 
gggataatca gggaagaaga aagttttcat ggagaaggtg acttttgaat caggaagaaa 177 63 
tggaaacagc aaactcttct agatagagga gacactaaca ggaaaggcag agaggcagga 17823 
aggtgtggga aagtgcacgt gaccccgttc agagagaaag ccaggtgtga gataaggggg 17883 
aaagactgtt agggcatgta ttgtaaacca ctaattccag gcaaaagtta gattttactt 17 943 
actaagcaag agtgcttcag ttagatccta gcaggaaatg gagggtatgc ttagaagagg 18003 
taactgaggc aagtttaatt tataaaggtg tgtgcagcat taagggaaac cagcaaggga 18063 
tactgagcat gccaggatgc aagagcaggc agggaaggtg actattccta ggtctgaagg 18123 
agaaagggga gggagcagtt cccagaaccc tagtaaaaat ggcaatgaga aaggtccatc 18183 
tggcaggacc tatggtcttt aacagaggga caaagtcaac ccacaacttg tctgggaggt 18243 
tgctgaggaa tagatacccc aacctctctc tcaacccact gcaacactct ttttccccta 18303 
gactgagccc agtcaaagac agagggagga gcccagtgat gcagtctgca atgtcatcat 18363 
cctggagcat gaatagagtg cagcagggtg aataatgagt ctgcaggaat taatagaaat 18423 
atctgacaca atagggaact ataagaggtt ttgaatagga gaggcccctg aaatgtgctc 18483 
caatattact gaactatgtg tggcccaaag aatggaagag gaacagctct tgcaataggt 18543 
ctgaggagag aagctgaaga cttggactag ggcaatggta aaaactgtgg aaagaagttt 18603 
taaatgaaaa gttttaaacc atgcggcttc cagctagatg aactttttta aaaaaattag 18663 
ttcctcactc aaattttggg gaggttatat attttctaat cataaaaaat gatttttctt 18723 
atttgtgggc ttttctcccc ag ate tga acctgtggtc ttgggagcca gggtgacctg 18781 
atatgacatc taaagaagct tctggactct gaacaagaat tcggtggcct gcagagcttg 18841 
ccatttgcac ttttcaaatg cctttggatg acccagcact ttaatctgaa acctgcaaca 18901 
agactageca acacctggcc atgaaacttg ccccttcact gatctggact cacctctgga 18961 
gectatgget ttaagcaagc actactgcac tttacagaat taccccactg gatcctggac 19021 
ccacagaatt ccttcaggat ccttcttgct gecagactga aagcaaaagg aattatttcc 19081 
cctcaagttt tctaagtgat ttccaaaagc agaggtgtgt ggaaatttcc agtaacagaa 19141 
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acagatgggt tgccaataga gttatttttt atctatagct tcctctgggt actagaagag 19201 
gctattgaga ctatga 19217 



<210> 19 
<211> 18 
<212> PRT 

<213> Homo sapiens 
<400> 19 

Met lie Phe Leu Leu Leu Met Leu Ser Leu Glu Leu Gin Leu His Gin 
15 10 15 

He Ala 



<210> 20 
<211> 102 
<212> PRT 

<213> Homo sapiens 
<400> 20 

Ala Leu Phe Thr Val Thr Val Pro Lys Glu Leu Tyr He He Glu His 
15 10 15 

Gly Ser Asn Val Thr Leu Glu Cys Asn Phe Asp Thr Gly Ser His Val 
20 25 30 

Asn Leu Gly Ala He Thr Ala Ser Leu Gin Lys Val Glu Asn Asp Thr 
35 40 45 

Ser Pro His Arg Glu Arg Ala Thr Leu Leu Glu Glu Gin Leu Pro Leu 
50 55 60 

Gly Lys Ala Ser Phe His He Pro Gin Val Gin Val Arg Asp Glu Gly 
65 70 75 80 

Gin Tyr Gin Cys He He He Tyr Gly Val Ala Trp Asp Tyr Lys Tyr 
85 90 95 

Leu Thr Leu Lys Val Lys 
100 



<210> 21 
<211> 91 
<212> PRT 

<213> Homo sapiens 
<400> 21 

Ala Ser Tyr Arg Lys He Asn Thr 
1 5 

Asp Glu Val Glu Leu Thr Cys Gin 
20 



His He Leu Lys Val Pro Glu Thr 
10 15 

Ala Thr Gly Tyr Pro Leu Ala Glu 
25 30 
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Val Ser Trp Pro Asn Val Ser Val Pro Ala Asn Thr Ser His Ser Arg 
35 40 45 



Thr Pro Glu Gly Leu Tyr Gin Val 
50 55 

Pro Pro Gly Arg Asn Phe Ser Cys 
65 70 

Glu Leu Thr Leu Ala Ser lie Asp 
85 



Thr Ser Val Leu Arg Leu Lys Pro 
60 

Val Phe Trp Asn Thr His Val Arg 

75 80 

Leu Gin Ser 
90 



<210> 22 
<211> 44 
<212> PRT 

<213> Homo sapiens 
<400> 22 

Gin Met Glu Pro Arg Thr His Pro Thr Trp Leu Leu His lie Phe lie 
15 10 15 

Pro Ser Cys He He Ala Phe He Phe He Ala Thr Val He Ala Leu 
20 25 30 

Arg Lys Gin Leu Cys Gin Lys Leu Tyr Ser Ser Lys 
35 40 



<210> 23 
<211> 17 
<212> PRT 

<213> Homo sapiens 
<400> 23 

Asp Thr Thr Lys Arg Pro Val Thr Thr Thr Lys Arg Glu Val Asn Ser 
15 10 15 

Ala 



<210> 24 
<211> 11 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 24 

Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg 
15 10 



<210> 25 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Internalizing 
domain derived from HIV tat protein 

<400> 25 

Gly Gly Gly Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg 
15 10 15 



<210> 26 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide Primer 2515-27 

<400> 26 

cataatagag catggcagca atgtgac 

<210> 27 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide Primer 2524-63 

<400> 27 

gggtcctgga gtggctggtg ttg 

<210> 28 
<211> 14 
<212> PRT 

<213> Homo sapiens 
<400> 28 

Leu Phe Thr Val Thr Val Pro Lys Glu Leu Tyr He He Glu 
15 10 



<210> 29 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide Primer 2515-24 



<400> 29 

gtggctcttt cacggtgtgg ggatg 



<210> 30 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide Primer 2538-68 

<400> 30 

ccagtgtcaa agttgcattc cagggt 
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